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Abstract 

During  the  period  of  7/1/2007  -  12/31/2007,  we  performed  the  following  studies  on  radar 
sensor  network: 

1.  Network-enabled  Electronic  Warfare  (NEW)  for  Collaborative  Automatic  Target  Recog¬ 
nition  (CATR); 

2.  Foliage  clutter  modeling  using  narrowband  and  UWB  radars; 

3.  A  propagation  Environment  Modeling  in  Foliage  using  UWB  radars; 

4.  Target  detection  in  foliage  using  short-time  Fourier  transform  and  UWB  radar  sensor 
networks; 

5.  Some  experimental  studies  on  path  loss  models  for  wireless  sensor  networks  based  on  Xbow 
motes, 

6.  Theoretical  studies  on  distributed  connected  dominating  set  construction  in  random  geo¬ 
metric  k-Disk  graphs  for  potential  application  to  real  sensor  networks. 


1  NEW-CATR:  Network-enabled  Electronic  Warfare  for  Collab¬ 
orative  Automatic  Target  Recognition 

Network-enabled  Electronic  Warfare  (NEW)  is  to  develop  modeling  and  simulation  effort  to  explore 
the  advantages  and  limitations  of  network-enabled  electronic  warfare  concepts.  The  advantages  of 
linking  multiple  electronic  support  measures  (ESM)  and  electronic  attack  (EA)  assets  to  achieve 
improved  capabilities  across  a  networked  battle  force  have  yet  to  be  quantified.  In  [1],  we  utilized 
radar  sensors  as  ESM  and  EA  assets  to  demonstrate  the  advantages  of  NEW  in  Collaborative  Au¬ 
tomatic  Target  Recognition  (CATR).  Signal  (waveform)  design  for  Radar  Sensor  Networks  (RSN) 
in  NEW  is  studied  theoretically.  The  conditions  for  waveform  coexistence  and  the  interferences 
among  waveforms  in  RSN  are  analyzed.  We  applied  the  NEW  to  CATR  via  waveform  diversity 
combining  and  propose  maximum-likelihood  (ML)-ATR  algorithms  for  non-fluctuating  targets  as 
well  as  fluctuating  targets.  Simulation  results  indicate  that  our  NEW-CATR  performs  much  bet¬ 
ter  than  the  single  sensor-based  ATR  algorithm  for  non-fluctuating  targets  and  fluctuating  targets. 
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Real  world  application  example,  sense-through-foliage  target  detection  using  UWB  radars,  was  also 
used  to  validate  our  algorithms. 

2  Foliage  Clutter  Modeling  Using  Narrowband  and  UWB  Radars 

In  [2],  we  studied  foliage  clutter  modeling  on  a  basis  of  both  pragmatically  narrowband  and  ultra¬ 
wide  band  (UWB)  radars.  We  proposed  that  the  foliage  clutter  follows  log-logistic  model  using 
maximum  likelihood  (ML)  parameter  estimation  as  well  as  the  root  mean  square  error  (RMSE)  on 
PDF  curves  between  original  clutter  and  statistical  model  data.  In  addition  to  investigating  the 
log-logistic  model,  we  also  compared  it  with  other  popular  clutter  models,  namely  log-normal  and 
Weibull.  We  showed  that  the  log-logistic  model  not  only  achieves  the  smallest  standard  deviation 
(STD)  error  on  estimated  model  parameters,  but  also  has  the  best  goodness-of-fit  and  smallest 
RMSE. 

3  A  Propagation  Environment  Modeling  in  Foliage  Using  UWB 
Radar 

In  [3],  we  proposed  that  foliage  clutter  follows  log-logistic  model  using  maximum  likelihood  (ML) 
parameter  estimation  as  well  as  the  root  mean  square  error  (RMSE)  on  PDF  curves  between  original 
clutter  and  statistical  model  data.  We  not  only  investigated  log-logistic  model,  but  compared  it 
with  other  popular  clutter  models,  namely  log-normal,  Weibull  and  Nakagami.  It  showed  that  the 
log-logistic  model  not  only  achieves  the  small-  est  standard  divination  (STD)  error  on  estimated 
model  parameters,  but  also  the  best  goodness-of-fit  and  smallest  RMSE  for  both  poor  and  good 
clutter  signals. 

4  Target  Detection  in  Foliage  Using  Short-Time  Fourier  Trans¬ 
form  and  UWB  Radar  Sensor  Networks 

In  [4],  we  studied  sense-through- foliage  target  detection.  When  radar  echoes  are  in  good  quality, 
the  detection  of  target  can  be  achieved  by  applying  short  time  Fourier  transform  (STFT)  to  the 
received  UWB  radar  waveform.  We  compared  our  approach  in  case  of  no  target  as  well  as  with 
target  against  the  scheme  in  which  2-D  image  was  created  via  adding  voltages  with  the  appropriate 
time  offset.  Results  show  that  our  approach  can  detect  target  more  easily.  When  radar  echoes  are 
in  poor  condition  and  single  radar  is  unable  to  carry  out  the  detection,  we  employed  both  Radar 
Sensor  Networks  (RSN)  and  R  AKE  structure  to  combine  the  echoes  from  different  radar  members 
and  finally  detected  the  target. 

5  Experimental  Path  Loss  Models  for  Wireless  Sensor  Networks 

Energy  conservation  is  critical  in  Wireless  Sensor  Networks.  Replacing  or  recharging  batteries  is 
not  an  option  for  sensors  deployed  in  hostile  environments.  Generally  communication  electronics 
in  the  sensor  utilizes  most  energy.  In  [5],  we  studied  the  effect  of  changing  the  transmission  power 
and  baud  rate  on  transmission  distance.  Using  Shannon  channel  capacity  formula  and  LogDistance 
Path  Loss  Model,  transmission  distance  is  shown  to  be  related  to  transmit  power  and  baud  rate. 
Extensive  empirical  readings  were  taken  to  confirm  the  above  relation.  The  path  loss  exponent  got 
as  a  result  of  data  fitting  is  within  the  acceptable  range  for  wireless  environment.  Using  the  equation 
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derived  in  [5],  the  distance  between  neighboring  motes  and  traffic  density,  it  will  be  possible  for 
sensors  to  adjust  their  transmit  power  and  baud  rate  so  as  to  use  only  the  required  amount  of 
energy  to  maintain  the  wireless  link  to  the  neighbor  and  conserve  power. 

6  Distributed  Connected  Dominating  Set  Construction  in  Ran¬ 
dom  Geometric  k-Disk  Graphs 

In  [6],  we  studied  the  problem  of  minimum  connected  dominating  set  in  random  geometric  k-disk 
graphs.  This  research  is  motivated  by  the  problem  of  virtual  backbone  construction  in  wireless  ad 
hoc  and  sensor  networks,  where  the  coverage  area  of  nodes  are  disks  with  different  radii.  We  derived 
the  size  relationship  of  any  maximal  independent  set  and  minimum  connected  dominating  set  in 
geometric  k-disk  graphs,  and  applied  it  to  analyze  the  performances  of  two  distributed  connected 
dominating  set  algorithms  we  proposed  in  [6].  These  algorithms  have  bounded  performance  ratio 
and  low  communication  overhead,  and  therefore  have  the  potential  to  be  applied  in  real  ad  hoc  and 
sensor  networks 


References 

[1]  Q.  Liang,  X.  Cheng,  S.  Samn,  “NEW-CATR:  Network-enabled  Electronic  Warfare  for  Collabo¬ 
rative  Automatic  Target  Recognition,”  submitted  to  IEEE  Trans  on  Aerospace  and  Electronic 
Systems. 

[2]  J.  Liang,  Q.  Liang,  “Foliage  Clutter  Modeling  Using  Narrowband  and  UWB  Radars,”  submitted 
to  IEEE  Globecom,  New  Orleans,  LA,  2008. 

[3]  J.  Liang,  Q.  Liang,  S.  Samn,  “A  Propagation  Environment  Modeling  in  Foliage,”  submitted  to 
IEEE  Trans  on  Antenna  and  Propagation. 

[4]  J.  Liang,  Q.  Liang,  “Target  Detection  in  Foliage  Using  Short-Time  Fourier  Transform  and  UWB 
Radar  Sensor  Networks,”  submitted  to  IEEE  Globecom ,  New  Orleans,  LA,  2008. 

[5]  R.  Sawant,  Q.  Liang,  D.  Popa,  and  F.  Lewis,  “Experimental  Path  Loss  Models  for  Wireless 
Sensor  Networks,”  presented  at  IEEE  MILCOM,  Orlando,  FL,  Oct  2007. 

[6]  D.  Chen,  K.  Xing,  Q.  Liang,  and  E.  Park,  “Distributed  Connected  Dominating  Set  Construc¬ 
tion  in  Random  Geometric  k-Disk  Graphs,”  accepted  by  IEEE  International  Conference  on 
Distributed  Computing  Systems  ( ICDCS '),  Beijing,  China,  June  2008. 


3 


NEW-CATR:  Network-enabled  Electronic  Warfare  for 


Collaborative  Automatic  Target  Recognition 


Qilian  Liang 


Xiuzhen  Cheng 


Sherwood  W.  Samn 


Dept  of  Electrical  Engineering  Dept  of  Computer  Science  Air  Force  Research  Laboratory /HEX 


University  of  Texas  at  Arlington  George  Washington  University 


Brooks  City  Base 


Arlington,  TX  76019-0016,  USA  Washington,  DC  20052,  USA  San  Antonio,  TX  78235,  USA 


E-mail:  liang@uta.edu 


E-mail:  cheng@gwu.edu  E-mail:  Sherwood.samn@brooks.af.mil 


Abstract 


Network-enabled  Electronic  Warfare  (NEW)  is  to  develop  modeling  and  simulation  effort 
to  explore  the  advantages  and  limitations  of  network-enabled  electronic  warfare  concepts.  The 
advantages  of  linking  multiple  electronic  support  measures  (ESM)  and  electronic  attack  (EA) 
assets  to  achieve  improved  capabilities  across  a  networked  battle  force  have  yet  to  be  quantified. 
In  this  paper,  we  utilize  radar  sensors  as  ESM  and  EA  assets  to  demonstrate  the  advantages 
of  NEW  in  Collaborative  Automatic  Target  Recognition  (CATR).  Signal  (waveform)  design  for 
Radar  Sensor  Networks  (RSN)  in  NEW  is  studied  theoretically.  The  conditions  for  waveform 
coexistence  and  the  interferences  among  waveforms  in  RSN  are  analyzed.  We  apply  the  NEW 
to  CATR  via  waveform  diversity  combining  and  propose  maximum-likelihood  (ML)-ATR  al¬ 
gorithms  for  non-fluctuating  targets  as  well  as  fluctuating  targets.  Simulation  results  indicate 
that  our  NEW-CATR  performs  much  better  than  the  single  sensor-based  ATR  algorithm  for 
non-fluctuating  targets  and  fluctuating  targets. 

Index  Terms  :  Network-enabled  electronic  warfare,  radar  sensor  networks,  waveform  diversity, 
collaborative  automatic  target  recognition,  maximum-likelihood,  interferences. 
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1  Introduction  and  Motivation 


In  current  and  future  military  operational  environments  such  as  Global  War  on  Terrorism  (GWOT) 
and  Maritime  Domain  Awareness  (MDA),  war  fighters  require  technology  that  can  support  their 
information  needs  in  manner  that  is  independent  of  their  location  and  consistent  with  their  level  of 
command  or  responsibility  and  operational  situation.  To  support  this  need,  the  U.S.  Department  of 
Defense  (DoD)  has  developed  the  concept  of  Network  Centric  Warfare  (NCW),  defined  as  “ military 
operations  that  exploit  state-of-the-art  information  and  networking  technology  to  integrate  widely 
dispersed  human  decision  makers,  situational  and  targeting  sensors,  and  forces  and  weapons  into 
a  highly  adaptive,  comprehensive  system  to  achieve  unprecedented  mission  effectiveness ”  [1].  The 
goal  of  electronic  warfare  is  to  control  the  electromagnetic  (EM)  spectrum  by  disrupting  or  denying 
enemy  use  of  the  spectrum  while  ensuring  its  use  by  friendly  forces  [2]. 

Network-enabled  Electronic  Warfare  (NEW)  is  to  develop  modeling  and  simulation  effort  to 
explore  the  advantages  and  limitations  of  network-enabled  electronic  warfare  concepts.  The  advan¬ 
tages  of  linking  multiple  electronic  support  measures  (ESM)  and  electronic  attack  (EA)  assets  to 
achieve  improved  capabilities  across  a  networked  battle  force  have  yet  to  be  quantified  [2].  In  this 
paper,  we  utilize  radar  sensors  as  ESM  and  EA  assets  to  demonstrate  the  advantages  of  NEW  in 
Collaborative  Automatic  Target  Recognition  (CATR).  The  network  of  radar  sensors  should  operate 
with  multiple  goals  managed  by  an  intelligent  platform  network  that  can  manage  the  dynamics  of 
each  radar  to  meet  the  common  goals  of  the  platform,  rather  than  each  radar  operates  as  an  inde¬ 
pendent  system.  Therefore,  it  is  significant  to  perform  signal  design  and  processing  and  networking 
cooperatively  within  and  between  platforms  of  radar  sensors  and  their  communication  modules. 
This  need  is  also  testified  by  the  recent  solicitation  from  the  U.S.  Office  of  Naval  Research  [2]  [3]. 
For  example,  in  [3],  it  is  stated  that  “Algorithms  are  sought  for  fused,  and/or,  coherent  cross¬ 
platform  Radio  Frequency  (RF)  sensing.  The  focus  of  this  effort  is  to  improve  surveillance  utilizing 
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a  network,  not  fusion  of  disparate  sensor  products.  The  algorithms  should  be  capable  of  utilizing 
RF  returns  from  multiple  aspects  in  a  time-coordinated  sensor  network.”  In  this  paper,  we  study 
waveform  design  and  diversity  algorithms  for  radar  sensor  networks.  Waveform  diversity  is  the 
technology  that  allows  one  or  more  sensors  on  board  a  platform  to  automatically  change  operating 
parameters,  e.g.,  frequency,  gain  pattern,  and  pulse  repetition  frequency  (PRF),  to  meet  the  vary¬ 
ing  environments.  It  has  long  been  recognized  that  judicious  use  of  properly  designed  waveforms, 
coupled  with  advanced  receiver  strategies,  is  fundamental  to  fully  utilizing  the  capacity  of  the  elec¬ 
tromagnetic  spectrum.  However,  it  is  the  relatively  recent  advances  in  hardware  technology  that 
are  enabling  a  much  wider  range  of  design  freedoms  to  be  explored.  As  a  result,  there  are  emerging 
and  compelling  changes  in  system  requirements  such  as  more  efficient  spectrum  usage,  higher  sensi¬ 
tivities,  greater  information  content,  improved  robustness  to  errors,  reduced  interference  emissions, 
etc.  The  combination  of  these  changes  is  fueling  a  worldwide  interest  in  the  subject  of  waveform 
design  and  the  use  of  waveform  diversity  techniques. 

In  the  existing  works  on  waveform  design  and  selection,  Fitzgerald  [8]  demonstrated  the  in¬ 
appropriateness  of  selection  of  waveforms  based  on  measurement  quality  alone:  the  interaction 
between  the  measurement  and  the  track  can  be  indirect,  but  must  be  accounted  for.  Bell  [6] 
used  information  theory  to  design  radar  waveforms  for  the  measurement  of  extended  radar  targets 
exhibiting  resonance  phenomena.  In  [5],  the  singularity  expansion  method  was  used  to  design  dis¬ 
criminant  waveforms  such  as  K-pulse,  E-pulse,  and  S-pulse.  Sowelam  and  Tewfik  [24]  developed  a 
signal  selection  strategy  for  radar  target  classification,  and  a  sequential  classification  procedure  was 
proposed  to  minimize  the  average  number  of  necessary  signal  transmissions.  Intelligent  waveform 
selection  was  studied  in  [4] [12] ,  but  the  effect  of  Doppler  shift  was  not  considered.  In  [16],  time- 
frequency- based  generalized  chirps  were  used  as  waveform  for  detection  and  estimation.  In  [15],  the 
performance  of  constant  frequency  (CF)  and  linear  frequency  modulated  (LFM)  waveform  fusion 
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from  the  standpoint  of  the  whole  system  was  studied,  but  the  effect  of  clutter  was  not  considered. 
In  [23],  CF  and  LFM  waveforms  were  studied  for  a  sonar  system,  but  it  was  assumed  that  the  sensor 
is  non-intelligent  (i.e.,  waveform  can’t  be  selected  adaptively).  All  the  above  studies  and  design 
methods  focused  on  the  waveform  design  or  selection  for  a  single  active  radar  or  sensor.  In  [21], 
cross-correlation  properties  of  two  radars  were  briefly  mentioned  and  the  binary  coded  pulses  using 
simulated  annealing  [7]  are  highlighted.  However,  the  cross-correlation  of  two  binary  sequences 
such  as  binary  coded  pulses  (e.g.  Barker  sequence)  is  much  easier  to  study  than  that  of  two  analog 
radar  waveforms.  In  this  paper,  we  focus  on  the  waveform  diversity  and  design  for  radar  sensor 
networks  using  the  constant  frequency  (CF)  pulse  waveform.  Relative  to  previous  work,  this  paper 
has  the  following  novelties: 

1.  A  networked  scenario  of  radar  sensors  is  considered  instead  of  a  single  radar  system. 

2.  We  study  waveform  design  and  diversity  for  radar  sensors  networks.  In  Space-Time  Adaptive 
Processing  (STAP)  [18],  the  waveform  (pulse)  design  is  essentially  for  a  single  radar  sensor. 
The  pulse  is  sent  repeatedly  at  different  time  and  echo  is  received  and  processed  by  the  antenna 
array,  and  no  interferences  among  pulses  if  the  pulse  repetition  interval  is  large  enough. 

3.  We  investigate  collaborative  automatic  target  recognition  using  radar  sensor  networks  and 
compare  it  against  single  radar  system  in  CATR. 

4.  Simulations  are  performed  for  nonfluctuating  targets  as  well  as  fluctuating  targets,  and  a  real 
world  application  example,  sense-through-foliage  target  detection,  is  presented. 

The  rest  of  this  paper  is  organized  as  follows.  In  Section  2,  we  study  the  coexistence  of  radar 
waveforms.  In  Section  3,  we  analyze  the  interferences  among  radar  waveforms.  In  Section  4,  we 
propose  a  RAKE  structure  for  waveform  diversity  combining  and  present  maximum-likelihood  (ML) 
algorithms  for  CATR.  In  Section  5,  we  provide  simulation  results  on  ML-CATR.  In  Section  6,  we 
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conclude  this  paper  and  discuss  future  research. 


2  Co-existence  of  Radar  Waveforms 


In  radar  sensor  networks  (RSN),  radar  sensors  interfere  with  each  other  and  the  signal-to-interference- 
ratio  may  be  very  low  if  the  waveforms  are  not  properly  designed.  In  this  paper,  we  introduce 
orthogonality  as  one  criterion  for  waveform  design  in  RSN  to  make  radars  coexistence.  In  addition, 
since  the  radar  channel  is  narrow-band,  we  will  also  consider  the  bandwidth  constraint. 

In  our  radar  sensor  networks,  we  choose  the  CF  pulse  waveform,  which  can  be  defined  as 

x(t)  =  exp(j2ir0t)  -  T/2  <  t  <  T/2  (1) 


where  (5  is  the  RF  carrier  frequency  in  radians  per  second.  In  radar,  ambiguity  function  (AF)  is  an 
analytical  tool  for  waveform  design  and  analysis,  which  succinctly  characterizes  the  behavior  of  a 
waveform  paired  with  its  matched  filter.  The  ambiguity  function  is  useful  for  examining  resolution, 
side  lobe  behavior,  and  ambiguities  in  both  range  and  Doppler  for  a  given  waveform  [18].  For  a 
single  radar,  the  matched  filter  for  waveform  x{t)  is  x*(— t),  and  the  ambiguity  function  of  CF  pulse 
waveform  is 


I  rT/ 2 

A(t,Fd)  =  \  %(t)  exp  (j2nFDs)x*(t  —  r)dt 

|  J —T  /2+r 


I  E sin[nFo(T  —  |r|)] 
TnFD 

We  can  simplify  this  AF  in  the  following  three  special  cases: 


—  T  <  r  <  T 


(2) 


1.  When  r  =  0, 


2.  when  Fjj  =  0, 


A(0,FD) 


E  sm(n  FpT) 
Tn(FD) 


E(T-\t\) 

T 


(3) 

(4) 
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3.  and  when  r  =  Fq  =  0, 


-4(0, 0)  =  E 


(5) 


Note  that  the  above  ambiguity  is  for  one  radar  only  (no  coexisting  radar). 

For  radar  sensor  networks,  the  waveforms  from  different  radars  interfere  with  each  other.  We 
choose  the  waveform  for  radar  z  as 


Xi{t) 


exp[j27r(/3  +  <5j)t] 


—  T/2  <t<  T/2 


(6) 


which  means  that  there  is  a  frequency  shift  St  for  radar  i.  To  minimize  the  interference  from  one 
waveform  to  another,  optimal  values  for  Si  should  be  determined  to  make  the  waveforms  orthogonal 
to  each  other,  i.e.,  let  the  cross-correlation  between  and  xn(t)  be  0, 
f T /2  £  fT I2 

/  Xi(t)x*n(t)dt  =  —  /  exp[j2n(P  +  6i)t]exp[-j2n(/3  +  6n)t]dt  (7) 

J-T/2  1  J —T/2 

=  £sinc[7r(<5i  -  8n)T]  (8) 


If  we  choose 


(9) 


where  i  is  a  dummy  index,  (8)  can  be  written  as 


[  E  i  =  n 
0  i^n 


(10) 


Therefore  choosing  Si  =  ^  in  (6)  yields  orthogonal  waveforms,  i.e.,  the  waveforms  can  coexist  if  the 
carrier  spacing  is  a  multiple  of  1/T  between  two  radar  waveforms.  In  other  words,  orthogonality 
amongst  carriers  can  be  achieved  by  separating  the  carriers  by  a  multiple  of  the  inverse  of  waveform 
pulse  duration.  With  this  design,  all  the  orthogonal  waveforms  can  work  simultaneously.  However, 
there  may  exist  time  delay  and  Doppler  shift  ambiguity  which  may  interfere  with  other  waveforms 
in  RSN. 
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3  Interferences  of  Waveforms  In  Radar  Sensor  Networks 


3.1  RSN  with  Two  Radar  Sensors 


We  are  interested  in  analyzing  the  interference  from  one  radar  to  another  if  there  exist  time  delay 
and  Doppler  shift.  For  a  simple  case  where  there  are  two  radar  sensors  ( i  and  n),  the  ambiguity 
function  of  radar  i  (considering  the  interference  from  radar  n)  is 


I  r°° 

Ai{ti,tn,  Fd{,  Fd„)  =  /  [xi(t)exp(j2nFDit)  +  xn(t  -  tn)exp(j2irFDnt)]x*(t  -  tjdt 

I  •/  —  OO 

rT/2+min(ti,tn) 

/  xn(t  -  tn)  exp (j 2tt FDn t)x* (t  -  U)dt 

J-~ 


(11) 


< 


+ 


T  /2+ma  x(ti,tn) 
•T/2 


rl/’Z 

/  Xi(t)  exp  (j2nFDit)x*(t  -  ti)dt 

J-T/2+U 


T/2+t 
T/2+min(ti,tn) 


(12) 


+ 


/  xn(t  -  tn)  exp(j2nFDnt)x*{t  -  U)dt 

*/ — T/2+max(tj,tn) 

E sin[K FDi(T  -  1 1, 


(13) 


TnFDi 

To  make  the  analysis  easier,  it  is  generally  assumed  that  the  radar  sensor  platform  has  access  to  the 
Global  Positioning  Service  (GPS)  and  the  Inertial  Navigation  Unit  (INU)  timing  and  navigation 
data  [3].  In  this  paper,  we  assume  that  the  radar  sensors  are  synchronized  and  that  tj  =  tn  =  r. 
Then  (13)  can  be  simplified  as 

Esin[nFDi{T  -  |r|)] 


Ai{T,FDi,FDn)  «  |£'sinc[7r(n  -  i  +  FDnT)]\  + 

We  have  the  following  three  special  cases: 

1.  If  F[)t  =  F[)n  =  0,  and  and  Sn  follow  (9),  (14)  becomes 

£(T-M)] 


TnFDi 


(14) 


Ai{r,  0, 0) 

2.  If  r  =  0,  (14)  becomes 

Ai(0,FDi,FDn)  ~  |^sinc[7r(n  -  i  +  FDnT)]\  + 


(15) 


E  sin(7 xFd{T) 


TnFDi 


(16) 
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3.  If  Fui  =  Fon  =  0,  r  =  0,  and  Si  and  Sn  follow  (9),  (14)  becomes 


4(0, 0,0)  «£ 


(17) 


3.2  RSN  with  M  Radar  Sensors 


Our  analysis  on  an  RSN  with  two  radar  sensors  can  be  extended  to  the  case  of  M  radars.  Assuming 
that  the  time  delay  r  for  each  radar  is  the  same,  then  the  ambiguity  function  of  radar  1  (considering 

interferences  from  all  the  other  M  —  1  radars  with  CF  pulse  waveforms)  can  be  expressed  as 

M 

A\{t,FDi  ,  •  •  •  ,FDm )  ss  |Esinc[7r(z  -  1  +  FDiT)\ \  + 

i= 2 

Similarly,  we  have  the  following  three  special  cases: 


E sin[7r FDl{T  -  |r|)j 


TnFD. 


(18) 


1-  Fdi  =  Fo2  =  •  ••  =  Fdm  =  0,  and  the  frequency  shift  Si  in  (6)  for  each  radar  follows  (9), 


then  (18)  becomes 


Ai  (r,  0, 0,  •  •  •  ,0)  « 


a(r-H)] 

T 


(19) 


Comparing  it  against  (4),  we  notice  that  a  radar  may  exist  that  can  get  the  same  signal 
strength  as  that  of  the  single  radar  in  a  single  radar  system  (no  coexisting  radar)  when  the 
Doppler  shift  is  0. 


2.  If  r  =  0,  then  (18)  becomes 

Ai(0,FDi,FD2,  •  •  •  ,FDm)  %  |£sinc[7r(2  -  1  4-  FDiT)]\  + 


M 


i= 2 


E  sin(7r  Fd^T) 


TitFd! 


(20) 


Comparing  to  (3),  a  radar  in  RSN  has  higher  interferences  when  unknown  Doppler  shifts 
exist. 


3.  Fpi  =  Fd2  =  •  •  •  =  Fqm  =  0,  r  =  0,  and  Si  in  (6)  follows  (9),  then  (18)  becomes 

j4i(0,0,0,---  ,0)  *E  (21) 
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4  NEW  for  Collaborative  Automatic  Target  Recognition 


In  NEW,  the  radar  sensors  are  networked  together  in  an  ad  hoc  fashion.  They  do  not  rely  on  a 
preexisting  fixed  infrastructure,  such  as  a  wireless  backbone  network  or  a  base  station.  They  are  self¬ 
organizing  entities  that  are  deployed  on  demand  in  support  of  various  events  surveillance,  battlefield, 
disaster  relief,  search  and  rescue,  etc.  Scalability  concern  suggests  a  hierarchical  organization 
of  radar  sensor  networks  with  the  lowest  level  in  the  hierarchy  being  a  cluster.  As  argued  in 
[9]  [10]  [13]  [17],  in  addition  to  helping  with  scalability  and  robustness,  aggregating  sensor  nodes 
into  clusters  has  additional  benefits: 

1.  conserving  radio  resources  such  as  bandwidth; 

2.  promoting  spatial  code  reuse  and  frequency  reuse; 

3.  simplifying  the  topology,  e.g.,  when  a  mobile  radar  changes  its  location,  it  is  sufficient  for  the 
nodes  in  the  attended  clusters  to  update  their  topology  information; 

4.  reducing  the  generation  and  propagation  of  routing  information;  and, 

5.  concealing  the  details  of  global  network  topology  from  individual  nodes. 

In  RSN,  each  radar  can  provide  its  waveform  parameters  such  as  S{  to  its  clusterhead  radar,  and 
the  clusterhead  radar  can  combine  the  waveforms  from  its  cluster  members. 

In  RSN  with  M  radars,  the  received  signal  for  clusterhead  (assume  it’s  radar  1)  is 

M 

n (u,t)  =  oi(u)xi(t  -  ti)exp(j2nFDit)  +  n(u,t)  (22) 

i= 1 

where  a(u )  stands  for  radar  cross  section  (RCS),  which  can  be  modeled  using  non-zero  constants 
for  11011-fluctuating  targets  and  four  Swerling  target  models  for  fluctuating  targets  [18];  Fd{  is  the 
Doppler  shift  of  the  target  relative  to  waveform  i;  ti  is  the  delay  of  waveform  i,  and  n(u,  t)  is  the 
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additive  white  Gaussian  noise  (AWGN).  In  this  paper,  we  propose  a  RAKE  structure  for  waveform 
diversity  combining,  as  illustrated  by  Fig.  1.  This  figure  summarizes  how  the  clusterhead  works. 
The  received  signal  ri(u,t)  consists  of  echos  triggered  by  the  waveforms  from  each  radar  sensor, 
and  x*(t  —  ti )  is  used  to  retrieve  the  amplified  waveform  from  radar  i  (amplified  by  the  target 
RCS)  based  on  the  orthogonal  property  presented  in  Sections  2  and  3,  and  then  this  information 
is  time-averaged  for  diversity  combining. 

According  to  this  structure,  the  received  ri(u,t)  is  processed  by  a  bank  of  matched  filters,  then 
the  output  of  branch  1  (after  integration)  is 


Z\(u]t\,- •  ■  ,tM,Fox ,•••  ,Fdm ) 

rT/ 2 


rl /\i 

=  /  r\(u,t)xl(t  —  ti)ds 

J-T/2 

/T/  2  M 

[V'aj(u)xj(f  -  ti)exp(j2nFDit)  +  n(u,t)]x\(t  -  t{)dt 

-772  ~1 


Assuming  t\  =  £2  =  •  •  •  =  t\j  =  r,  then  based  on  (18), 

M 

Zi(u;t,FDi,---  ,FDm)  y^q(u)E’sinc[7r(i  -  1  +  FDiT)\ 

t=2 

Q(u)Esin[7rFDl(T- |r|)]  ,  _N 
+  - - +n(“'r) 

Similarly,  we  can  get  the  output  for  any  branch  m  (m  =  1, 2,  •  •  •  ,  M), 

M 

Zm{u]T,  ,Fqm)  «  ^2  a(u)Esmc[n(i  -  m  + FDiT)] 

a{u)E sin[n FDm(T  -  |r|)] 

+  - wz - +n{u’T) 


(23) 

(24) 


(25) 


(26) 


Therefore  Zm(u;  r,  Fpl ,  •  •  •  ,  Fdm)  consists  of  three  parts,  namely  signal  (reflected  signal  from  radar 
m  waveform):  d^(T-1t|)]  ^  jnterferences  from  other  waveforms:  YliL\  i^m  a(u)^sinc[7r(i  “ 

m  +  Fo{T)],  and  noise:  n(u,r). 

We  can  also  have  the  following  three  special  cases  for  | Zm(u;  r,  Fqx  ,  •  •  •  ,  Fqm) |: 
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1.  When  FDl  =  ■■■  =  FDm  =  0, 


Zm(u;T,  0,  (),•••  ,0) 


Ea(u)(T  -  \t\)} 


+  n(u,  t) 


(27) 


which  means  that  if  there  is  no  Doppler  mismatch,  there  is  no  interference  from  other  wave¬ 
forms. 


2.  If  r  =  0,  (26)  becomes 


Zm{u',0, Fpx ,  •  •  •  ,Ff)M) 


M 

^2  tt(u)£sinc[7r(i  -  m  +  FDiT)\  +  +  n(u 


(28) 


i=l, 


3.  If  r  =  0,  and  Fdx  =■••■  =  Fdm  =  0,  (26)  becomes 


Zm(u\  0, 0, 0,  •  •  •  ,  0)  «  Ea(u)  +  n(u)  (29) 

Doppler  mismatch  happens  quite  often  in  target  search  where  target  velocity  is  not  yet  known. 
However,  in  target  recognition,  generally  high-resolution  measurements  of  targets  in  range 
(r  =  0)  and  Doppler  are  available,  therefore  (29)  will  be  used  for  CATR. 

How  to  combine  all  the  Zm's  (m  =  1,2,  ••  •  ,  M)  is  very  similar  to  the  diversity  combining  in 
wireless  communications  to  combat  channel  fading,  and  the  combination  schemes  may  be  different 
for  different  applications.  In  this  paper,  we  are  interested  in  applying  the  RSN  waveform  diversity 
to  CATR,  e.g.,  recognition  that  the  echo  on  a  radar  display  is  that  of  an  aircraft,  ship,  motor 
vehicle,  bird,  person,  rain,  chaff,  clear-air  turbulence,  land  clutter,  sea  clutter,  bare  mountains, 
forested  areas,  meteors,  aurora,  ionized  media,  or  other  natural  phenomena  via  collaborations 
among  different  radars.  Early  radars  were  “blob”  detectors  in  that  they  detected  the  presence  of  a 
target  and  gave  its  location  in  range  and  angle,  and  radar  began  to  be  more  than  a  blob  detector  and 
could  provide  recognition  of  one  type  of  target  from  another  [21].  It  is  known  that  small  changes  in 
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the  aspect  angle  of  complex  (multiple  scatter)  targets  can  cause  major  changes  in  the  radar  cross 
section  (RCS).  This  has  been  considered  in  the  past  as  a  means  of  target  recognition,  and  is  called 
fluctuation  of  radar  cross  section  with  aspect  angle ,  but  it  has  not  had  much  success  [21].  In  [19], 
a  parametric  filtering  approach  was  proposed  for  target  detection  using  airborne  radar.  In  [14], 
knowledge-based  sensor  networks  were  applied  to  threat  assessment.  In  this  paper,  we  propose 
maximum  likelihood  collaborative  automatic  target  recognition  (ML-CATR)  algorithms  for  RSN. 
We  study  non-fluctuating  targets  as  well  as  fluctuating  targets. 


4.1  ML-CATR  for  Non-fluctuating  Targets 


In  some  sources,  the  non-fluctuating  target  is  identified  as  a  uSwerling  0”  or  “Swerling  5”  model  [22]. 
For  non-fluctuating  targets,  the  RCS  am(u)  is  just  a  constant  a  for  a  given  target.  In  (29), 
n(u,r)  is  a  zero-mean  Gaussian  random  variable  for  a  given  r,  so  |Zm(u;  0, 0,  •  •  •  ,0)|  follows  a 
Rician  distribution  because  signal  Ea(u)  is  a  positive  constant  Ea  for  non-fluctuating  targets.  Let 
yrn  =  |Zm(u;0,0,  •  •  •  ,0)|,  then  the  probability  density  function  (pdf)  of  ym  is 


f(  \  tym  r  (2/771  "b  ^2)ir 
f(Vm)  =  ~^2~  exP[ - - ]/o(-^-) 


(30) 


where 

A  =  Ea,  (31) 

a1  is  the  noise  power  (with  I  and  Q  sub-channel  power  cr2/2),  and  /q(-)  is  the  zero-order  modified 

Bessel  function  of  the  first  kind.  Let  y  =  [2/1 , 2/2 7  *  *  *  ,2/m],  then  the  pdf  of  y  is 

M 

/( y)  =  n  (32) 

771=1 

Our  CATR  is  a  multiple-category  hypothesis  testing  problem,  i.e.,  to  decide  a  target  category 
(e.g.  aircraft,  ship,  motor  vehicle,  bird,  etc.)  based  on  r\(u,t).  Assume  there  are  totally  N 
categories  and  a  category  n  target  has  RCS  an,  therefore  the  ML-CATR  algorithm  to  decide  a 
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target  category  C  can  be  expressed  as, 

C  =  argmax^=1/(y|A  =  Ean)  (33) 

M  c, 

N  TT 

=  arg  maxn=1  [[  —  ex 

m= 1 

4.2  ML-CATR  for  Fluctuating  Targets 

Fluctuating  target  modeling  is  more  realistic  in  which  the  target  RCS  is  drawn  from  either  the 
Rayleigh  or  chi-square  of  degree  four  pdf.  The  Rayleigh  model  describes  the  behavior  of  a  complex 
target  consisting  of  many  scatters,  none  of  which  is  dominant.  The  fourth-degree  chi-square  models 
targets  having  many  scatters  of  similar  strength  with  one  dominant  scatter.  Based  on  different 
combinations  of  pdf  and  decorrelation  characteristics  (scan-to-scan  or  pulse- to-pulse  decorrelation), 
four  Swerling  models  are  used  [18].  In  this  paper,  we  focus  on  the  “Swerling  2”  model  which  is 
a  Rayleigh  distribution  with  pulse-to-pulse  decorrelation.  The  pulse-to-pulse  decorrelation  implies 
that  each  individual  pulse  results  in  an  independent  value  for  RCS  a. 

For  the  Swerling  2  model,  the  RCS  |a(u)|  follows  a  Rayleigh  distribution  and  its  I  and  Q 
subchannels  follow  zero- mean  Gaussian  distributions  with  a  variance  72.  Assume 


p[-(^  +  f2Q")]/o(2£aoym) 


(34) 


a(u)  =  a  j(u)  +  jaQ(  u)  (35) 

and  n(u)  =  nj(u)  +  jriQ{u)  follows  a  zero- mean  complex  Gaussian  distribution  with  a  variance 
cr2  for  the  I  and  Q  subchannels.  Therefore  according  to  (29),  Zm(u;  0,0,0,  •••  ,0)  is  a  zero-mean 
Gaussian  random  variable  with  a  variance  E2 72  +  a2  for  the  I  and  Q  subchannels,  which  means 
ym  =  \Z,n(u;  0. 0,  •  ■  •  ,  0) |  follows  a  Rayleigh  distribution  with  a  parameter  \J E2r)'2  +  a2, 

f(.ym)  =  £272  +  a2  exP(“  e2i™+  <t2  ^  (36) 

The  mean  value  of  ym  is  y  n^E  72  \  and  the  variance  is  ^4-  n^E2  7  ^ .  The  variance  of  signal  is 

2^  ^  and  the  variance  of  noise  is  , 
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Let  y  =  [j/i,2/2,  •  •  •  ,  Vm],  then  the  pdf  of  y  is 

M 

/( y)  =  n  tty™)  (3?) 

m=l 

Assume  there  are  totally  N  categories  and  a  category  n  target  has  a  RCS  an(u)  (with  a  variance 
7^),  so  the  ML-ATR  algorithm  to  decide  a  target  category  C  can  be  expressed  as, 


C 


argmax^=1/(y|7  =  7n) 
M 

Vm 


N  TT  fi/m  ,  Vra 

arg maxn=1  +  e*P(- ^  +  a2 , 


(38) 

(39) 


5  Simulations  and  Real  World  Application  Example 

5.1  Computer  Simulations 

Radar  sensor  networks  will  be  required  to  detect  a  broad  range  of  target  classes.  Too  often,  the 
characteristics  of  objects  that  are  not  of  interest  (e.g.,  bird)  are  similar  to  those  of  threat  objects 
(e.g.,  missile).  Therefore,  new  techniques  to  discriminate  threat  against  undesired  detections  (e.g. 
birds,  etc.)  are  needed.  We  applied  our  ML-CATR  to  this  important  application,  to  recognize  a 
target  from  many  target  classes.  We  assume  that  the  domain  of  target  classes  is  known  a  priori  (N 
in  Sections  4.1  and  4.2),  and  that  the  RSN  is  confined  to  work  only  on  the  known  domain. 

For  non-fluctuating  target  recognition,  our  targets  have  5  classes  with  different  RCS  values, 
which  are  summarized  in  Table  1  [21].  We  applied  the  ML-CATR  algorithms  in  Section  4.1  (for 
the  non-fluctuating  target  case)  to  classify  an  unknown  target  as  one  of  these  5  target  classes. 
At  each  average  SNR  value,  we  ran  Monte-Carlo  simulations  for  105  times  for  each  target.  The 
average  SNR  value  is  based  on  the  average  power  from  all  targets  (signal  variance),  so  the  actual 
SNRs  for  bird  and  missile  are  much  lower  than  the  average  SNR  value.  For  example,  at  the 
average  SNR=16dB,  the  bird  target  SNR=-33.1646dB,  and  the  missile  target  SNR=0.8149dB;  and 


14 


at  average  SNR=20dB,  the  bird  target  SNR=-29.1646dB,  and  the  missile  target  SNR=4.8149dB. 
In  Fig.  2(a)(b),  we  plotted  the  probability  of  the  ATR  error  in  bird  and  missile  recognition  when 
they  are  assumed  as  non-fluctuating  targets.  These  figures  indicate  that  a  single  radar  system  can’t 
perform  well  in  both  recognitions,  whose  probability  of  the  ATR  error  is  above  10%,  which  can’t 
be  used  for  real-world  ATR.  However,  the  5-radar  RSN  and  10-radar  RSN  can  maintain  very  low 
ATR  errors.  In  Fig.  2(c),  we  plotted  the  average  probability  of  the  ATR  error  for  all  5  targets 
recognition.  Since  the  other  3  targets  (different  aircrafts)  have  much  higher  SNRs,  their  ATR  error 
is  lower,  which  makes  the  average  probability  of  ATR  error  lower. 

For  fluctuating  target  recognition,  we  assume  the  fluctuating  targets  follow  the  “Swerling  2” 
model  (Rayleigh  distribution  with  pulse-to-pulse  decorrelation),  and  assume  the  RCS  value  listed  in 
Table  1  to  be  the  standard  deviation  (std)  7n  of  RCS  an(u)  for  target  n.  We  applied  the  ML-CATR 
algorithm  in  Section  4.2  (for  the  fluctuating  target  case)  for  target  recognition  within  the  5  targets 
domain.  Similarly  we  ran  Monte-Carlo  simulations  at  each  SNR  value.  In  Fig.  3(a) (b)(c),  we 
plotted  the  ATR  performance  for  fluctuating  targets  and  compared  the  performances  of  a  single¬ 
radar  system,  a  5-radar  RSN,  and  a  10-radar  RSN.  Observe  that  the  two  RSNs  perform  much  better 
than  the  single  radar  system.  The  ATR  error  for  the  missile  is  higher  than  that  of  bird  because 
the  Rayleigh  distribution  of  the  missile  has  a  lot  of  overlap  with  its  neighbor  targets  (aircrafts). 
Comparing  Fig.  2(a)(b)(c)  to  Fig.  3(a) (b)(c),  it  is  clear  that  higher  SNRs  are  needed  for  the 
fluctuating  target  recognition  comparing  to  the  non-fluctuating  target  recognition.  According  to 
Skolnik  [21],  the  radar  performance  with  a  probability  of  recognition  error  (pe)  less  than  10%  is 
good  enough.  Our  RSN  with  waveform-diversity  can  achieve  a  probability  of  ATR  error  much  less 
than  10%  for  each  target  ATR  as  well  as  the  average  ATR  for  all  targets.  However,  the  single 
radar  system  has  a  probability  of  ATR  error  much  higher  than  10%.  Fig.  3(c)  also  tells  us  that  the 
average  probability  of  ATR  error  of  a  single-radar  system  is  impossible  to  be  less  than  10%  even  at 
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an  extremely  high  SNR.  Our  RSN  with  waveform  diversity  is  very  promising  for  real-world  ATR. 


5.2  Real  World  Application  Example 

We  verified  our  approach  based  on  a  real  world  application  example,  sense-through-foliage  target 
detection  from  U.S.  Air  Force  Research  Laboratory.  The  target  is  a  trihedral  reflector  (as  shown 
in  Fig.  4)  in  a  forest,  we  plot  two  collections  using  UWB  radars  in  Figs.  5a  and  5b.  Fig.  5a  has 
no  target  on  range,  and  Fig.  5b  has  target  at  samples  around  13,900.  We  plot  the  echo  differences 
between  Figs.  5a  and  5b  in  Fig.  5c.  However,  it  is  impossible  to  identify  whether  there  is  any 
target  and  where  there  is  target  based  on  Fig.  5c,  which  means  single  radar  doesn’t  work  even 
in  ideal  case.  Since  significant  pulse-to-pulse  variability  exists  in  the  echos,  this  motivate  us  to 
explore  the  spatial  and  time  diversity  using  radar  sensor  networks  approach.  The  echos,  i.e. ,  RF 
responses  by  the  pulse  of  each  cluster-member  radar,  are  combined  by  the  clusterhead  using  the 
RAKE  structure  in  Fig.  1. 

We  ran  simulations  for  an  RSN  with  30  radars,  and  plot  the  power  of  AC  values  in  Figs.  6a 
and  6b  for  the  two  cases  (with  target  and  without  target)  respectively.  Observe  that  in  Fig.  6b, 
the  power  of  AC  values  (around  sample  13,900)  where  the  target  is  located  is  non-fluctuating 
(monotonically  increase  then  decrease).  Although  some  other  samples  also  have  very  high  AC 
power  values,  it  is  very  clear  that  they  are  quite  fluctuating  and  the  power  of  AC  values  behaves 
like  random  noise  because  generally  the  clutter  has  Gaussian  distribution  in  the  frequency  domain. 

6  Conclusions  and  Future  Works 

We  have  studied  the  constant  frequency  pulse  waveform  design  and  diversity  in  radar  sensor  net¬ 
works.  We  showed  that  the  waveforms  can  coexist  if  the  carrier  frequency  spacing  is  a  multiple 
of  1/T  between  two  radar  waveforms.  We  made  analysis  on  interferences  among  waveforms  in 
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RSN  and  proposed  a  RAKE  structure  for  waveform  diversity  combining  in  RSN.  As  an  application 
example,  we  applied  the  waveform  design  and  diversity  to  CATR  in  RSN  and  proposed  ML-CATR 
algorithms  for  non-fluctuating  targets  as  well  as  fluctuating  targets.  Simulation  results  show  that 
an  RSN  using  our  waveform  diversity-based  ML-ATR  algorithms  performs  much  better  than  a  sin¬ 
gle  radar  system  for  non-fluctuating  targets  and  fluctuating  targets  recognition.  We  also  validated 
our  RSN  approach  via  a  real-world  sense-through-foliage  application  example. 

In  our  future  research,  we  will  investigate  the  CATR  when  multiple  targets  coexist  in  RSN,  and 
the  number  of  targets  are  time-varying.  In  this  paper,  we  used  spatial  diversity  combining.  For 
multi-target  ATR,  we  will  further  investigate  spatial-temporal-frequency  combining  for  waveform 
diversity  in  RSN. 
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Table  1:  RCS  values  at  microwave  frequency  for  5  targets. 


Index  n 

Target 

RCS 

i 

Bird 

0.01 

2 

Conventional  unmanned  winged  missile 

0.5 

3 

Small  single-engine  aircraft 

1 

4 

Small  fighter  aircraft  or  4  passenger  jet 

2 

5 

Large  fighter  aircraft 

6 
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Figure  1:  Waveform  diversity  combining  by  clusterhead  in  RSN. 
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(a) 


(b) 


(c) 


Figure  2:  Probability  of  ATR  error  for  non-fluctuating  targets  at  different  average  SNR  (dB)  values, 
(a)  bird,  (b)  missile,  (c)  the  average  probability  of  ATR  error  for  5  targets. 
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(a) 


(c) 


Figure  3:  Probability  of  ATR  error  for  fluctuating  targets  at  different  average  SNR  (dB)  values, 
(a)  bird,  (b)  missile,  (c)  the  average  probability  of  ATR  error  for  5  targets. 


24 


Figure  4:  The  target  (a  trihedral  reflector)  is  shown  on  the  stand  at  300  feet  from  the  measurement 
lift. 


25 


(b) 


(c) 

Figure  5:  Measurement  of  UWB  radar,  (a)  Expanded  view  of  traces  (no  target)  from  sample 
13,001  to  15,000.  (b)  Expanded  view  of  traces  (with  target)  from  sample  13,001  to  15,000.  (c)  The 
differences  between  (a)  and  (b). 
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(a) 


Figure  6:  Power  of  AC  values  based  on  radar  sensor  networks  approach,  (a)  No  target  (b)  With 
target  in  the  field  (in  samples  around  13,900). 
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Abstract — In  this  paper,  we  study  foliage  clutter  modeling 
on  a  basis  of  both  pragmatically  narrowband  and  ultra-wide 
band  (UWB)  radars.  We  propose  that  the  foliage  clutter  follows 
log-logistic  model  using  maximum  likelihood  (ML)  parameter 
estimation  as  well  as  the  root  mean  square  error  (RMSE)  on 
PDF  curves  between  original  clutter  and  statistical  model  data.  In 
addition  to  investigating  the  log-logistic  model,  we  also  compare 
it  with  other  popular  clutter  models,  namely  log-normal  and 
Weibull.  We  show  that  the  log-logistic  model  not  only  achieves 
the  smallest  standard  deviation  (STD)  error  on  estimated  model 
parameters,  but  also  has  the  best  goodness-of-fit  and  smallest 
RMSE. 

I.  Introduction  and  Motivation 

Clutter  is  a  term  used  to  define  all  unwanted  echoes  from 
natural  environment  [1].  The  nature  of  clutter  may  necessarily 
vary  on  a  basis  of  different  applications  and  radar  parameters. 
Most  previous  studies  have  investigated  land  clutter  and  sea 
clutter.  For  example,  log-normal  and  Weibull  distributions 
have  been  proven  to  be  better  suited  for  the  clutter  than 
Rayleigh  and  Rician  models  in  high  resolution  radar  systems. 
As  far  as  clutter  modeling  in  forest  is  concerned,  it  is  still 
of  great  interest  and  is  likely  to  take  some  time  to  reach  any 
agreement.  A  team  of  researchers  from  MIT  [2]  and  U.  S. 
Army  Research  Laboratory  (ARL)  [3]  [4]  have  measured  ultra- 
wideband  (UWB)  backscatter  signals  in  foliage  for  different 
polarizations  and  frequency  ranges.  The  measurements  show 
that  the  foliage  clutter  is  impulsively  corrupted  with  multi- 
path  fading,  which  leads  to  inaccuracy  of  the  K-distributions 
description  [5].  The  Air  Force  Office  of  Scientific  Research 
(AFOSR)  has  conducted  field  measurement  experiment  con¬ 
cerning  foliage  penetration  radar  since  2004  and  noted  that 
metallic  targets  may  be  more  easily  identified  with  wideband 
than  with  narrowband  signals  [6]. 

In  this  investigation,  we  will  apply  both  narrowband  and 
ultra-wide  band  (UWB)  radar  to  model  the  foliage  clutter, 
as  we  believe  that  foliage  clutter  is  composed  of  intervening 
materials  that  are  electromagnetically  dispersive,  which  con¬ 
tributes  to  the  strong  frequency  dependence  of  foliage,  and 
thus  a  narrowband-wideband  study  would  assist  with  the  better 
understanding  of  statistic  property  of  the  clutter. 

Narrowband  signals  have  been  tried  at  200,  400  and  600 
megahertz  respectively,  while  UWB  radar  emissions  are  at  a 
relatively  low  frequency-typically  between  100  MHz  and  3 
GHz.  Each  frequency  component  in  a  radar  signal  will  sense 


the  foliage  clutter  in  a  slightly  different  manner,  therefore 
provide  differences  in  multipath. 

In  our  present  work,  we  investigate  the  use  of  the  log- 
logistic  distribution  (LLD)  to  model  foliage  clutter  and  il¬ 
lustrate  the  goodness-of-fit  to  real  UWB  clutter  data.  Addi¬ 
tionally,  we  compare  the  goodness-of-fit  with  existing  popular 
models  namely  log-normal,  Weibull  by  means  of  maximum 
likelihood  estimation  (MLE)  and  the  root  mean  square  error 
(RMSE).  The  result  shows  that  log-logistic  model  provides  the 
best  fit  to  the  foliage  clutter  for  both  narrowband  and  UWB 
signals. 

The  rest  of  this  paper  is  organized  as  follows.  In  Section 
II  we  discuss  the  properties  and  applicability  of  log-logistic 
as  a  statistical  model  for  foliage  clutter.  The  measurement 
and  collection  of  clutter  data  we  used  in  this  paper  are 
summarized  in  Section  III.  Section  IV  discribes  the  estimation 
of  model  parameters  and  the  goodness-of-fit.  Finally,  section 
V  concludes  this  paper  and  describes  some  future  work. 

II.  Log-logistic  Model 

Log-logistic  has  been  applied  recently  in  hydrological 
analysis.  In  spite  of  its  intensive  application  in  precipitation 
and  stream-flow  data,  the  LLD  [7]  statistical  model,  to  the  best 
of  our  knowledge,  has  never  been  applied  to  radar  foliage 
clutter.  The  motivation  for  considering  log-logistic  model  is 
based  on  its  higher  kurtosis  and  longer  tails,  as  well  as  its 
shape  similarity  to  log-ndrmal  and  Weibull  distributions.  Thus 
it  is  intended  to  be  employed  to  estimate  how  well  the  model 
matches  our  collected  foliage  clutter  statistics. 

The  PDF  for  LLD  distribution  is  given  by 

lna-n 

/(*)  = -  ,  *  >  0,  a  >  0  (1) 

ax(  1  +  e  47  )2 

where  is  scale  parameter  and  a  is  shape  parameter. 

The  mean  of  the  the  LLD  is 

E{x}  =  eMT(l  4-  cr)r(l  -  a)  (2) 

whereas  the  median  is 

M{x}  —  (3) 

The  variance  is  given  by 

Var{x}  =  e2/1{r(l  +  2a)r(l  -  2a)  -  [r(l  +  <r)r(l  -  a)]2} 

(4) 


while  the  moment  of  order  k  is 

E{xk }  =  ae^B(ka,  1  —  k  <  —  (5) 

a 

where 

B(m,n)  =  [  xm~l(\  —  x)n~1dx  (6) 

Jo 

The  PDFs  for  selected  /i’s  and  cr’s  for  the  LLD  distribution 
are  illustrated  in  Fig.  1.  We  will  apply  LLD  along  with 
lognormal  and  Weibull  models  to  analyze  the  clutter  data. 


X 


Fig.  1.  Log-logistic  distribution  PDF  for  /x  =  0.5  and  cr  =  0.5,  /x  =  0.5 
and  er  =  1,  /x  =  2  and  a  —  0.5,  /x  =  2  and  <7  =  1 

III.  Experiment  Setup  and  Data  Collection 

The  foliage  penetration  measurement  effort  began  in  August 
2005  and  continued  through  December  2005.  Working  in 
August  through  the  fall  of  2005,  the  foliage  measured  included 
late  summer  foliage  and  fall  and  early  winter  foliage.  Late 
summer  foliage,  because  of  the  limited  rainfall,  involved 
foliage  with  decreased  water  content.  Late  fall  and  winter 
measurements  involved  largely  defoliated  but  dense  forest. 


Fig.  2.  This  figure  shows  the  lift  with  the  experiment.  The  antennas  are  at 
the  far  end  of  the  lift  from  the  viewer  under  the  roof  that  was  built  to  shield 
the  equipment  from  the  elements.  This  picture  was  taken  in  September  with 
the  foliage  largely  still  present.  The  cables  coming  from  the  lift  are  a  ground 
cable  to  an  earth  ground  and  one  of  4  tethers  used  in  windy  conditions. 

The  radar  experiment  was  constructed  on  a  seven-ton  man 
lift,  which  had  a  total  lifting  capacity  of  450  kg.  The  limit 
of  the  lifting  capacity  was  reached  during  the  experiment  as 
essentially  the  entire  measuring  apparatus  was  placed  on  the 
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Fig.  3.  Signal  at  200  megahertz  (a)  transmitted  pulse  (b)  received  pulse 
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Fig.  4.  Signal  at  400  megahertz  (a)  transmitted  pulse  (b)  received  pulse 
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Fig.  5.  Signal  at  600  megahertz  (a)  transmitted  pulse  (b)  received  pulse 
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Fig.  6.  Signal  at  600  megahertz  (a)  transmitted  pulse  (b)  received  pulse 


lift  (as  shown  in  Fig.  2).  The  principle  pieces  of  equipment 
secured  on  the  lift  are:  dual  antenna  mounting  stand;  two 
antennas;  200MHz,  lKw  Amplifier,  power  supply,  pre-amp; 
400MHz,  lKw  Amplifier,  power  supply,  pre-amp;  600MHz, 
lKw  Amplifier,  power  supply,  pre-amp;  Tektronix  model  7704 
B  oscilloscope;  rack  system;  IBM  laptop;  HP  signal  Generator; 
Custom  RF  switch  and  power  supply  and  weather  shield  (small 
hut).  Particularly  for  UWB  signal,  a  Barth  pulse  source  (Barth 
Electronics,  Inc.  model  732  GL)  was  used.  The  pulse  generator 
uses  a  coaxial  reed  switch  to  discharge  a  charge  line  for  a  very 
fast  rise  time  pulse  outputs.  The  model  732  pulse  generator 
provides  pulses  of  less  than  50  picoseconds  (ps)  rise  time, 
with  amplitude  from  150  V  to  greater  than  2  KV  into  any 
load  impedance  through  a  50  ohm  coaxial  line.  The  generator 
is  capable  of  producing  pulses  with  a  minimum  width  of  750 
ps  and  a  maximum  of  1  microsecond.  This  output  pulse  width 
is  determined  by  charge  line  length  for  rectangular  pulses,  or 
by  capacitors  for  1/e  decay  pulses. 

The  system  was  pointing  at  the  specified  250  feet  one  way 
distance.  For  the  data  we  used  in  this  paper,  each  sample  is 
spaced  at  50  picoseconds  interval,  and  16,000  samples  were 
collected  for  each  collection  for  a  total  time  duration  of  0.8 
microseconds.  35  pulses  reflected  clutter  signal  were  obtained 
for  each  collection  at  the  same  frequency  and  the  same  site, 
but  different  time. 

The  transmitted  pulses  and  received  signal  are  shown  in 
Fig.  3~  6.  We  averaged  the  35  pulses  in  order  to  remove 
the  random  noise  from  the  clutter.  It  is  obvious  to  see  the 
presence  of  multiple  scattering  from  received  signals,  UWB 
in  particular. 

IV.  Statistical  Analysis  of  the  Foliage  Clutter 
Data 

A.  Maximum  Likelihood  Estimation 

Using  the  collected  clutter  data  mentioned  above,  we  apply 
Maximum  Likelihood  Estimation  (MLE)  approach  to  estimate 
the  parameters  of  the  log-logistic,  log-normal  and  Weibull 
models.  MLE  is  often  used  when  the  sample  data  are  known 
and  parameters  of  the  underlying  probability  distribution  are 
to  be  estimated  [8]  [9].  It  is  generalized  as  follows: 

Let  yi,  j/2»  •  ■  • »  2/tv  be  N  independent  samples  drawn  from  a 
random  variable  Y  with  m  parameters  02,  •  *  • ,  0m>  where 
6i  £  6 ,  then  the  likelihood  function  expressed  as  a  function  of 
6  conditional  on  Y  is 


N 

Ln{ Y|0)  =  Y[  fy\e(yk\0u02i  *  •  •  i  ^m)  (7) 

k= 1 

The  maximum  likelihood  estimate  of  02,  •  •  • ,  0m  is  the  set 
of  values  6\,  02,  •  •  • ,  0m  that  maximize  the  likelihood  function 
LN(Y\0). 

As  the  logarithmic  function  is  monotonically  increas¬ 
ing,  maximizing  L^{Y\6)  is  equivalent  to  maximizing 
ln(Ljv(Y|0)).  Hence,  it  can  be  shown  that  a  necessary  but  not 
sufficient  condition  to  obtain  the  ML  estimate  6  is  to  solve  the 
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Fig.  7.  Model  Comparison  (a)  200  megahertz  (b)  400  megahertz  (c)  600 
megahertz  (d)  UWB 


TABLE  I 

Estimated  Parameters  and  STD  Error 


PDF 

Log-Logistic 

Log-normal 

Weibull 

200MHz 

A  =  6.95521 
a  =  0.619019 
e p  =  0.00848 
=  0.00409 

A  =  6.95754 
<7  =  1.12545 
=  0.00890 
£*  =  0.00629 

a  =  1816.39 

b  =  0.95900 
£a  =  15.8698 
£b  =  0.00560 

400MHz 

A  =  7.138 
<7  =  0.67077 
e p  =  0.00922 
£*  =  0.00441 

A  =  7.03467 
<7  =  1.22477 
£^  =  0.00968 
£a  =  0.00685 

a  =  1970.62 

6=  1.06597 
£a  =  15.38150 
£b  =  0.00671 

600MHz 

A  =  6.26835 
a  =  0.98842 
=  0.01334 
£*  =  0.00637 

A  =  6.24074 
<7  =  1.68841 
£»  =  0.01383 
£a  =  0.00944 

a  =  1164.35 

b  =  0.67637 
£a  =  14.3965 
£b  =  0.00414 

UWB 

A  =  7.76868 
a  =  0.78651 
=  0.01078 
£<r  =  0.00522 

A  =  7.79566 
<7  =  1.41771 
£^  =  0.01121 
£<r  =  0.00793 

a  =  4901.07 

6  =  0.74322 
£a  =  55.3011 
eb  =  0.00434 

likelihood  equation 

^ln(LN(Y|0))=O  (8) 

Using  the  collected  clutter  radar  mentioned  above,  we  apply 
MLE  to  obtain  jl  and  a  for  log-logistic,  jl  and  a  for  the  log- 
normal,  a  and  b  for  the  Weibull.  The  results  are  shown  in 
Table  I.  We  also  explore  the  standard  deviation  (STD)  error 
of  each  parameter.  These  descriptions  are  also  shown  in  table 
I  in  the  form  of  exi  where  x  denotes  different  parameter  for 
each  model. 

From  Table  I,  we  can  see  that  no  matter  what  frequency 
is  used,  in  general  STD  error  for  log-logistic  parameters  are 
smaller  than  those  of  log-normal  and  Weibull. 

B.  Goodness-of-fit  in  curve  and  RMSE 

We  may  also  observe  the  extend  to  which  the  PDF  curve 
of  the  statistic  model  matches  that  of  clutter  data  by  intuitive 
visual  inspection  as  well  as  calculating  the  root  mean  square 
error  (RMSE). 

The  goodness-of-fit  in  curve  are  illustrated  in  Fig.  7.  The 
histograms  of  original  radar  clutter  data  are  skewed  to  the 
right.  In  general,  LLD  tends  to  have  lower  kurtosis,  steeper 
slope  than  those  of  lognormal  and  Weibull  but  lower  PDF  in 
tail  than  that  of  lognormal.  At  200MHz,  LLD  and  lognormal 
are  superior  to  Weibull.  At  400MHz,  the  performance  of 
lognormal  is  the  worst.  At  600MHz,  Weibull  become  the  worst 
again  and  as  for  UWB  signal,  Weibull  can  not  fit  well. 

To  further  evaluate  the  goodness-of-fit,  the  RMSE  is  ap¬ 
plied.  Let  i  (i=l,2,  •••  ,  n)  be  the  sample  index  of  clutter 
amplitude,  c»  is  the  corresponding  PDF  value  whereas  c*  is  the 
PDF  value  of  the  statistical  model  with  estimated  parameters 
by  means  of  MLE  as  mentioned  above.  The  RMSE  is  obtained 
through  the  following  equation: 


RMSE  = 


\ 


t=i 


(9) 


Here  we  apply  n=  101  for  each  model.  The  result  is  given  in 
Table  II. 


TABLE  II 

Root  Mean  Square  Error  (RMSE) 


PDF 

Log-Logistic 

Log-normal 

Weibull 

200MHz 

2.157  x  ltrs 

2.3248  x  Kr5 

3.5849  x  10"s 

400MHz 

2.2423  x  10~b 

2.8956  x  10"* 

2.6764  x  10-a 

600MHz 

3.3864  x  10~b 

3.4278  x  10-s 

3.7259  x  10-s 

UWB 

2.739  x  10~b 

3.1866  x  10~b 

T4045 

This  shows  that  despite  of  visual  inspection,  the  log-logistic 
model  is  more  accurate  than  the  other  two  models. 

V.  Conclusion 

On  a  basis  of  foliage  clutter  data  measured  by  the  both 
narrowband  (200MHz,  400MHz  and  600MHz)  and  UWB 
radars,  we  show  that  it  is  more  accurate  to  describe  foliage 
clutter  using  log-logistic  statistic  model  rather  than  log-normal, 
Weibull.  Future  research  will  investigate  the  characteristics 
of  target  to  better  achieve  the  target  detection,  tracking  and 
imaging. 
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Abstract 


In  this  paper,  we  propose  that  foliage  clutter  follows  log-logistic  model  using  maximum 
likelihood  (ML)  parameter  estimation  as  well  as  the  root  mean  square  error  (RMSE)  on 
PDF  curves  between  original  clutter  and  statistical  model  data.  We  not  only  investigate 
log-logistic  model,  but  compare  it  with  other  popular  clutter  models,  namely  log-normal, 
Weibull  and  Nakagami.  It  shows  that  the  log-logistic  model  not  only  achieves  the  small¬ 
est  standard  divination  (STD)  error  on  estimated  model  parameters,  but  also  the  best 
goodness-of-fit  and  smallest  RMSE  for  both  poor  and  good  clutter  signals. 

Index  Terms  :  foliage  clutter,  log-logistic,  log-normal,  Weibull,  Nakagami,  goodness-of-fit 

1  Introduction  and  Motivation 

Detection  and  identification  of  military  equipment  in  a  strong  clutter  background,  such  as 
foliage,  soil  cover  or  building  has  been  a  long-standing  subject  of  intensive  study.  It  is  believed 
that  solving  the  target  detection  through  foliage  environment  will  significantly  benefit  sense- 
through- wall  and  many  other  subsurface  sensing  problems.  However,  to  this  date,  the  detection 
of  foliage-covered  military  targets  with  the  required  probability  of  detection  and  false  alarm  still 
remains  a  challenging  issue.  Recent  investigations  on  environment  behavior  of  tree  canopies 
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have  shown  that  both  signal  backscattering  and  attenuation  are  significantly  influenced  by  tree 
architecture  [2].  Therefore  use  the  return  signal  from  foliage  to  establish  the  clutter  model  that 
accounts  for  environment  effects  is  crucial  for  the  sense- through- foliage  radar  detection. 

Clutter  is  a  term  used  to  define  all  unwanted  echoes  from  natural  environment  [3].  The 
nature  of  clutter  may  necessarily  vary  on  a  basis  of  different  applications  and  radar  parame¬ 
ters.  Most  previous  studies  have  investigated  land  clutter  or  sea  clutter,  and  some  conclusions 
have  been  reached.  For  example,  log-normal,  Weibull,  and  K-distributions  have  been  proven 
to  be  better  suited  for  the  clutter  description  other  than  Rayleigh  and  Rician  models  in  high 
resolution  radar  systems.  Fred  [4]  did  statistical  comparisons  and  found  that  sea  clutter  at 
low  grazing  angles  and  high  range  resolution  is  spiky  based  on  the  data  measured  from  various 
sites  in  Kauai  and  Hawaii.  David  generalized  radar  clutter  models  using  noncentral  chi-square 
density  by  allowing  the  noncentrality  parameter  to  fluctuate  according  to  the  gamma  distri¬ 
bution  [5].  Furthermore,  Henry  et  al  used  a  Neural-Network-based  approach  to  predict  sea 
clutter  model  [6]  [7]. 

However,  as  far  as  clutter  modeling  in  forest  is  concerned,  it  is  still  of  great  interest  and 
will  be  likely  to  take  some  time  to  reach  any  agreement.  A  team  of  researchers  from  MIT  [8] 
and  U.  S.  Army  Research  Laboratory  (ARL)  [9]  [10]  have  measured  ultra-wideband  (UWB) 
backscatter  signals  in  foliage  for  different  polarizations  and  frequency  ranges.  The  measure¬ 
ments  show  that  the  foliage  clutter  is  impulsively  corrupted  with  multipath  fading,  which  leads 
to  inaccuracy  of  the  K-distributions  description  [11].  The  Air  Force  Office  of  Scientific  Research 
(AFOSR)  has  conducted  field  measurement  experiment  concerning  foliage  penetration  radar 
since  2004  and  noted  that  metallic  targets  may  be  more  easily  identified  with  wideband  than 
with  narrowband  signals  [12]. 

In  this  investigation,  we  will  apply  ultra-wide  band  (UWB)  radar  to  model  the  foliage 
clutter.  UWB  radar  emissions  are  at  a  relatively  low  frequency-typically  between  100  MHz 
and  3  GHz.  Additionally,  the  fractional  bandwidth  of  the  signal  is  very  large  (greater  than 
0.2).  Such  a  radar  sensor  has  exceptional  range  resolution  that  also  has  an  ability  to  penetrate 
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many  common  materials  (e.g.,  walls).  Law  enforcement  personnel  have  used  UWB  ground  pen¬ 
etrating  radars  (GPRs)  for  at  least  a  decade.  Like  the  GPR,  sense-through-foliage  radar  takes 
advantage  of  UWB’s  very  fine  resolution  (time  gating)  as  well  as  low  frequency  of  operation. 

In  our  present  work,  we  investigate  the  use  of  the  log-logistic  distribution  to  model  foliage 
clutter  and  illustrate  the  goodness-of-fit  to  real  UWB  clutter  data  conducted  by  AFOSR. 
Additionally,  we  compare  the  goodness-of-fit  with  existing  popular  models  namely  log-normal, 
Weibull,  and  Nakagami  by  means  of  maximum  likelihood  estimation  (MLE)  and  the  root 
mean  square  error  (RMSE).  The  result  shows  that  log-logistic  model  provides  the  best  fit  to 
the  foliage  clutter.  Our  contribution  is  not  only  the  new  proposal  on  the  foliage  clutter  model 
with  estimated  concrete  parameters,  but  also  provide  the  criteria  and  approaches  based  on 
which  the  statistical  analysis  is  deduced.  Further,  the  theoretical  study  about  the  probability 
of  detection  as  well  as  the  probability  of  false  alarm  is  discussed. 

The  rest  of  this  paper  is  organized  as  follows.  Section  2  provides  a  statistical  model  review 
on  log-logistic,  log-normal,  Weibull  and  Nakagami  distributions  and  discuss  their  properties 
and  applicability  as  models  for  foliage  clutter.  Section  3  summarizes  the  measurement  and  the 
2  sets  of  clutter  data  that  we  used  in  this  paper.  Section  4  discusses  estimation  on  parameters 
and  the  goodness-of-fit  for  log-logistic,  log-normal,  Weibull  and  Nakagami  models  respectively. 
Section  5  analyzes  the  performance  of  radar  detection  at  presence  of  foliage  clutter.  Finally, 
section  6  concludes  this  paper  and  describes  some  future  research  topics. 

2  Clutter  Models 

Many  radar  clutter  models  have  been  proposed  in  terms  of  distinct  statistical  distributions, 
most  of  which  describe  the  characteristics  of  clutter  amplitude  or  power.  Before  detailed 
analysis,  first  we  would  like  to  discuss  the  properties  and  applicability  of  log-logistic,  log¬ 
normal,  Weibull,  and  Nakagami  statistic  distributions,  which  are  designated  as  “curve  fit” 
models  in  section  4,  since  they  are  more  likely  to  provide  good  fit  to  our  collections  of  pragmatic 
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clutter  data  in  general.  Detailed  explanations  would  be  given  in  following  subsections. 

2.1  Log-logistic  Model 

Recently  Log-logistic  model  has  been  applied  in  hydrological  analysis.  This  distribution  is  a 
special  case  of  Burr’s  type-XII  distribution  [15]  as  well  as  a  special  case  of  the  kappa  distribution 
proposed  by  Mielke  and  Jonson  [16].  Lee  et  al.  employed  the  LLD  for  frequency  analysis  of 
multiyear  drought  durations  [17],  whereas  Shoukri  et  al.  employed  LLD  to  analyse  extensive 
Canadian  precipitation  data  [18],  and  Narda  &  Malik  used  LLD  to  develop  a  model  of  root 
growth  and  water  uptake  in  wheat  [19].  In  spite  of  its  intensive  application  in  precipitation 
and  stream-flow  data,  the  log-logistic  distribution  (LLD)  [13]  statistical  model,  to  the  best  of 
our  knowledge,  has  never  been  applied  to  radar  foliage  clutter.  The  motivation  for  considering 
log-logistic  model  is  based  on  its  higher  kurtosis  and  longer  tails,  as  well  as  its  PDF  curve 
similarity  to  log-normal  and  Weibull  distributions.  It  is  intended  to  be  employed  to  estimate 
how  well  the  model  matches  our  collected  foliage  clutter  statistics. 

Here  we  apply  the  two-parameter  distribution  with  parameters  /i  and  a.  The  PDF  for  this 
distribution  is  given  by 

In  x  —  p. 

/(*)  = - - - Inx-a  ,  *>0,  <T>0  (1) 

<rx(l  +  e  "  ) 2 

where  /i  is  scale  parameter  and  a  is  shape  parameter.  The  mean  of  the  the  LLD  is 

E{x }  =  e/ir(l  4-  cr)r(l  -  a)  (2) 

The  variance  is  given  by 

Var{x}  =  e^{r(l  +  2<r)r(l  -  2a)  -  [r(l  +  a) r(l  -  a)]2}  (3) 

while  the  moment  of  order  k  is 

E{xkj  =  ae^B(k(j,  1  —  /ccr),  k  <  —  (4) 
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where 


B(m,n)=  f  xm  1(l-a;)n  ldx  (5) 

Jo 

PDFs  for  LLD  for  selected  /Ps  and  cr’s  are  illustrated  in  Fig.  1. 

2.2  Log-normal  Model 

Most  previous  experimental  data  have  resulted  in  clutter  being  modeled  using  a  log-normal 
distribution,  which  is  most  frequently  used  when  the  radar  sees  land  clutter  [20]  or  sea  clutter 
[21]  at  low  grazing  angles  (<  5  degrees)  since  log-normal  has  a  long  tail.  However,  it  has  been 
reported  that  the  log-normal  model  tends  to  overestimate  the  dynamic  range  of  the  real  clutter 
distribution  [22].  Furthermore,  most  previous  research  applies  log-normal  model  to  land  and 
sea  clutter,  but  how  accurately  it  models  foliage  clutter  requires  detailed  analysis. 

The  log-normal  distribution  [23]  is  also  a  two-parameter  distribution  with  parameters  fi 
and  a.  The  PDF  for  this  distribution  is  given  by 

^  _  (im-M)2 

/(*)  = - 7=e  2ff  ,  x>0,<7>0  (6) 

xa\Z2rc 

where  is  the  scale  parameter  and  a  is  the  shape  parameter.  The  mean,  variance  and  the 
moment  of  order  k  are  given  respectively  by 


E{x}  =  eM+<2 

(7) 

Var{x}  =  (e*2  -  l)e2»+°2 

(8) 

E{xk }  =  ek»+k2f 

(9) 

PDFs  for  selected  /Ps  and  cr’s  for  log-normal  distribution  are  shown  in  Fig.  2. 

2.3  Weibull  Model 

The  Weibull  distribution,  which  is  named  after  Waloddi  Weibull,  can  be  made  to  fit  clutter 
measurements  that  lie  between  the  Rayleigh  and  log-normal  distribution  [24].  It  has  been 
applied  to  land  clutter  [25]  [26],  sea  clutter  [27]  [28]  and  weather  clutter  [29].  However,  in 
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very  spiky  sea  and  foliage  clutter,  the  description  of  the  clutter  statistics  provided  by  Weibull 
distributions  may  not  always  be  sufficiently  accurate  [30]. 

The  Weibull  distribution  is  also  a  two-parameter  distribution  with  parameters  a  and  b. 
The  PDF  for  this  distribution  is  given  by 

f(x)  =  ba~bxb~le~^x/a>)\  £  >  0,a  >  0,  6  >  0  (10) 

where  b  is  the  shape  parameter  and  a  is  the  scale  parameter.  The  mean,  variance  and  the 
moment  of  order  k  axe  given  respectively  by 


E{x}  =  ar(l  +  h 

0 

(11) 

Var{x}  =  a2{r(l  +  -b)-[T(l  +  -b)}2} 

(12) 

E{xk }  =  akT(l  +  j) 

0 

(13) 

PDFs  for  selected  a’s  and  6’s  for  Weibull  distribution  axe  shown  in  Fig.  3. 


2.4  Nakagarni  Model 


In  the  foliage  penetration  setting,  the  target  returns  suffer  from  multipath  effects  corrupted 
with  fading.  As  Nakagarni  distribution  is  used  to  model  scattered  fading  signals  that  reach 
a  receiver  by  multiple  paths,  it  is  natural  to  investigate  how  well  it  fits  the  foliage  clutter 
statistics. 

The  PDF  for  Nakagarni  distribution  is  given  by 


/(i)  =  2(-)"-i-a:<2'‘  n  e  "x  ,  x  >  0,u  >  0  (14) 

uj  r(/x) 

where  /z  is  the  shape  parameter  and  u  is  the  scale  parameter.  The  mean,  variance  and  the 
moment  of  order  k  of  Nakagarni  distribution  are  given  respectively  by 


E{x} 


r(/z  + ;)  u a 
m  V 


Var{x} 


w[l  — 


1  2)  \2 

/  r(M)  ; 


(15) 

(16) 


6 


(17) 


The  PDFs  for  selected  /Ts  and  u/s  for  the  Nakagami  distribution  are  illustrated  in  Fig.  4. 


3  Experiment  Setup  and  Data  Collection 

Our  work  is  based  on  the  sense-through-foliage  data  from  Air  Force  Research  Lab  [12].  The 
foliage  penetration  measurement  effort  began  in  August  2005  and  continued  through  December 
2005.  Working  in  August  through  the  fall  of  2005,  the  foliage  measured  included  late  summer 
foliage  and  fall  and  early  winter  foliage.  Late  summer  foliage,  because  of  the  limited  rainfall, 
involved  foliage  with  decreased  water  content.  Late  fall  and  winter  measurements  involved 
largely  defoliated  but  dense  forest. 

A  bistatic  UWB  radar  (individual  transmit  and  receive  antennas)  system  was  used  and 
the  experiment  was  constructed  on  a  seven-ton  man  lift,  which  had  a  total  lifting  capacity 
of  450  kg.  The  limit  of  the  lifting  capacity  was  reached  during  the  experiment  as  essentially 
the  entire  measuring  apparatus  was  placed  on  the  lift  (as  shown  in  Fig.  5).  The  principle 
pieces  of  equipment  secured  on  the  lift  are:  Barth  puls  generator,  Tektronix  model  7704  B 
oscilloscope,  dual  antenna  mounting  stand,  two  antennas,  rack  system,  IBM  laptop,  HP  signal 
Generator,  Custom  RF  switch  and  power  supply  and  Weather  shield  (small  hut).  Throughout 
this  work,  a  Barth  pulse  source  (Barth  Electronics,  Inc.  model  732  GL)  was  used.  The  pulse 
generator  uses  a  coaxial  reed  switch  to  discharge  a  charge  line  for  a  very  fast  rise  time  pulse 
outputs.  The  model  732  pulse  generator  provides  pulses  of  less  than  50  picoseconds  (ps)  rise 
time,  with  amplitude  from  150  V  to  greater  than  2  KV  into  any  load  impedance  through  a  50 
ohm  coaxial  line.  The  generator  is  capable  of  producing  pulses  with  a  minimum  width  of  750 
ps  and  a  maximum  of  1  microsecond.  This  output  pulse  width  is  determined  by  charge  line 
length  for  rectangular  pulses,  or  by  capacitors  for  1/e  decay  pulses. 

For  the  return  data  we  used  in  this  paper,  each  sample  is  spaced  at  50  picoseconds  interval, 
and  16,000  samples  were  collected  for  each  collection  for  a  total  time  duration  of  0.8  microsec- 
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onds  at  a  rate  of  approximately  20  Hz.  We  considered  two  sets  of  data  from  this  experiment. 
Initially,  the  Barth  pulse  source  was  operated  at  lower  amplitude  and  35  pulses  of  clutter  sig¬ 
nals  were  obtained  at  each  site  but  different  time.  These  pulses  have  been  averaged  to  remove 
the  random  noise.  Data  have  been  collected  from  10  different  sites,  one  collection  of  transmit¬ 
ted  pulse  and  received  backscattering  are  shown  in  Fig.  6(a)  and  (b)  respectively.  Although 
pulse-to-pulse  variability  was  noted  for  collections  of  received  echoes,  the  fading  tendency  of 
different  returned  signals  are  the  same.  These  data  is  referred  to  as  data  set  I. 

Later,  additional  improvements  were  made  in  the  measurement  procedure,  include  the 
improved  isolation  of  transmit  and  receive  antennas,  the  addition  of  a  log-periodic  antenna 
(Antenna  Research  Associates  LPC-2010-C)  as  a  transmit  antenna,  and  the  EMCO  ridged 
waveguide  horn  (Microwave  horn,  EMCO  3106).  Echoes  for  data  set  II  were  collected  using 
this  higher  amplitude  transmitted  pulses.  2  collections  at  different  site  with  100  pulsese  average 
have  been  obtained,  one  of  which  is  shown  in  Fig.  6(c).  To  make  them  clearer  to  readers,  we 
provide  expanded  views  of  received  traces  from  sample  10,000  to  12,000  in  Fig.  7. 

4  Statistical  Analysis  of  the  Foliage  Clutter  Data 

4.1  Maximum  Likelihood  Estimation 

Using  the  collected  clutter  data  mentioned  above,  we  apply  Maximum  Likelihood  Estimation 
(MLE)  approach  to  estimate  the  parameters  of  the  log-logistic,  log-normal,  Weibull,  and  Nak- 
agami  models.  MLE  is  often  used  when  the  sample  data  are  known  and  parameters  of  the 
underlying  probability  distribution  are  to  be  estimated  [32]  [33].  It  is  generalized  as  follows: 

Let  yi,  2/2,  •  *  * ,  Vn  be  N  independent  samples  drawn  from  a  random  variable  Y  with  m 
parameters  #i,  #2,  ■  •  • ,  0m,  where  0%  E  0,  then  the  likelihood  function  expressed  as  a  function 
of  6  conditional  on  Y  is 

N 

Ln(Y\Q)  =  fY\o(yk\0\,Q2,  '  •  •  ,0m)  (18) 

k=l 
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The  maximum  likelihood  estimate  of  Q\,  62,  •  •  • ,  0m  is  the  set  of  values  6\ ,  §2 ,  • ,  6m  that 

maximize  the  likelihood  function  L^(Y\6). 

As  the  logarithmic  function  is  monotonically  increasing,  maximizing  Ljv(Y\0)  is  equiva¬ 
lent  to  maximizing  \n(L^i{Y\9)).  Hence,  it  can  be  shown  that  a  necessary  but  not  sufficient 
condition  to  obtain  the  ML  estimate  0  is  to  solve  the  likelihood  equation 

^  \n(LN(Y\0))  =  0  (19) 

Note  that  the  amplitude  of  foliage  clutter  faded  with  the  increase  of  sample  time.  Even  At 
the  same  sample,  it  varies  for  different  collections.  In  order  to  better  analyze  its  randomness, 
we  studied  each  collection.  Using  the  collected  clutter  radar  mentioned  above,  we  apply  MLE 
to  obtain  [1  and  a  for  log-logistic,  [l  and  <r  for  the  log-normal,  a  and  b  for  the  Weibull,  and  fi 
and  Co  for  the  Nakagami.  The  estimation  results  for  data  set  I  are  listed  in  table  1.  We  also 
explore  the  standard  deviation  (STD)  error  of  each  parameter.  These  descriptions  are  shown 
in  table  1  in  the  form  of  ex,  where  x  denotes  different  parameter  for  each  model.  We  also 
calculate  the  average  values  of  estimated  parameters  and  their  STD  errors  in  table  2. 

From  table  1  and  2  we  can  see  STD  error  for  log-logistic  and  log-normal  parameters  are  less 
than  0.02  and  their  estimated  parameters  vary  little  from  data  to  data  compared  to  Weibull 
and  Nakagami.  It  is  obvious  that  log- logistic  model  provides  the  smallest  STD  error  for  all  the 
10  collections  compared  to  log-normal.  It  is  obvious  that  apply  both  Weibull  and  Nakagami 
models,  accurate  shape  parameter  estimation  can  be  achieved  but  the  result  of  scale  parameter 
estimations  are  not  acceptable. 

The  estimation  results  for  data  set  II  are  shown  in  table  3.  Due  to  the  improvement  on  this 
set  of  signal,  STD  error  for  log-logistic  and  log-normal  parameters  have  been  reduced  compared 
those  of  data  set  I.  However,  for  Weibull  and  Nakagami,  it  is  a  different  case,  which  implies 
log-logistic  and  log-normal  are  much  more  accurate  to  model  foliage  clutter. 

In  the  view  of  error  on  parameter  estimation,  log-logistic  model  fits  the  collected  data  best 
compared  to  log-normal,  Weibull,  and  Nakagami.  Log-normal  is  also  acceptable. 
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4.2  Goodness-of-fit  in  curve  and  RMSE 


We  may  also  observe  the  extend  to  which  the  PDF  curve  of  the  statistic  model  matches  that 


of  clutter  data  by  calculating  the  averaged  root  mean  square  error  (RMSE)  for  each  data  set. 
Let  i  (i=l,  2,  •  •  •  ,  n)  be  the  sample  index  of  clutter  amplitude,  c*  is  the  corresponding  PDF 
value  whereas  c*  is  the  PDF  value  of  the  statistical  model  with  estimated  parameters  by  means 
of  MSE.  The  RMSE  is  obtained  through 


(20) 


Here  we  apply  n=100  for  each  model  and  k  is  the  number  of  data  collections  for  each  set. 

In  Fig.  8  and  9,  we  use  one  collection  from  data  set  I  and  II  respectively  to  illustrate 


the  goodness-of-fit  in  curve.  Also,  we  calculate  the  averaged  RMSE  of  each  model  for  both 


collected  data  set  I  and  II.  The  PDF  of  absolute  amplitude  of  one  collection  of  clutter  data 
is  presented  by  means  of  histogram  bars.  In  Fig.  8,  it  can  be  seen  obviously  that  log-logistic 
model  with  MLE  parameters  provides  the  best  goodness-of-fit  compared  to  the  other  models, 
since  it  provides  the  most  suitable  kurtosis,  slope  and  tail.  As  for  the  maximum  PDF  value,  the 
log- logistic  is  about  1  x  10-3,  while  that  of  other  models  are  over  1.2  x  10-3.  For  the  slope  part, 
which  connects  the  kurtosis  and  the  tail  and  which  is  in  the  range  from  0.1  x  104  to  0.5  x  104  in 
view  of  x  axes,  the  log-logistic  provides  the  smallest  skewness  whereas  Nakagami  provides  the 
largest.  Examination  of  the  tails  show  that  log-logistic  and  log-normal  provide  very  similar¬ 
valued  tails,  while  tails  of  the  Weibull  and  the  Nakagami  are  lager  than  the  collected  data. 
Meanwhile,  we  obtain  that  RMSE iog-iogistic  =  2.5425  x  10“5,  RMSE/o^_norma/  =  3.2704  x  10“5, 
RMSEwez6u//  =  3.7234  x  10-5,  RMSE  Nakagami  =  5.4326  x  10“5.  This  also  shows  that  the  log- 
logistic  model  is  more  accurate  than  the  other  three  models. 

Similarly,  in  Fig.  9  histogram  bars  denote  the  PDF  of  the  absolute  amplitude  of  one  col¬ 
lection  of  clutter  data  from  set  II.  Compared  to  Fig  8,  the  log-logistic  and  the  log-normal 
provide  more  similar  extend  of  goodness-of-fit.  Weibull  is  worse  since  it  cannot  fit  well 
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in  either  kurtosis  or  tail,  while  Nakagami  is  the  worst  and  unacceptable.  Also,  we  obtain 
RMSE;og_(offistic  =  2.739  x  1(T5,  RMSEiog_norma/  =  3.1866  x  10“5,  mSEWeibuU  =  3.6361  x  1(T5, 
RMSEjvafcagami  =  4.4045  x  10-5.  This  illustrates  that  for  clutter  backscattering  data  set  II,  the 
log-logistic  model  still  fits  the  best. 


5  Target  Detection  Performance 


As  we  have  mentioned  previously,  one  of  the  primary  goal  to  be  carried  out  by  a  radar  is  target 
detection.  On  a  basis  of  the  clutter  model  that  have  been  just  studied,  we  may  apply  a  special 
case  of  the  Bayesian  criterion  named  Neyman-Person  criterion  to  analyze  the  target  detection 
performance  in  the  foliage  environment. 

If  the  received  sample  signal  to  be  tested  is  R,  then  the  two  hypotheses  are  shown  as  follows: 


H0  :  R  —  C  T  n 


(21) 


H\  :  R  =  S  +  C  +  n 

where  C  and  n  represent  the  random  variable  of  clutter  and  noise  respectively.  C  follows 
log-logistic  model  with  both  parameters  p  and  cr,  and  n  is  gaussian  noise  with  zero  mean  and 
variance  v2 .  S  is  the  target  signal,  which  assumes  to  be  a  constant  for  simplicity. 

Therefore  f(R\Ho)  and  f(R\H\)  mean: 

f(R\H0)  =  PDF  of  R  given  that  a  target  was  not  present 
f(R\Hi)  =  PDF  of  R  given  that  a  target  was  present 


They  can  be  denoted  as  follows: 

roc 

f(R\H0)  = 

Jo 

f(R\Hl)=  f 
Jo 


e  ” 


ar(l  +  e‘Jirrli)2  \/2m 


1  , 
- e  2i/2  dr 


ln(r-a)-n 

e  ° 


nv 


1  — 

x  , _  e  2i/2  dr 


(22) 


(23) 


<r(r  —  s)(  1  4-  e  (  o'*  M  )2 
If  the  probability  that  a  target  was  not  present  is  P(Ho)  whereas  that  of  a  target  was 

present  is  P(Hi),  then  PDF  of  R  is 

f(R)  =  P(Ho)f(R\H0)  +  P(//1)/(i?|i/1)  (24) 
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To  decide  whether  there  is  a  target  or  not,  Ney man- Pearson  detection  rule  is  shown  as 


Ho 

f(R\Hp)  >  P(H\) 
f(R\Hi)  £  P(H o) 

In  case  of  P(H\)  =  P(Hq),  (26)  is  simplified  as 

Ho 

f(R\H0)  <  f(R\Hx) 

Hi 


(25) 


which  actually  is 


-2 s(R-r)  ,  Mf+j),  H0 

to* - '  +  — S J  > 


Inr-ji  < 

1+e  ]2 


(26) 


(27) 


r  — S  L  /w(r-3)-‘/T J 

1+e  c 


It  is  easy  to  obtain  the  decision  threshold  T  in  terms  of  the  above  function 


T  =  ~—ln[  1+,6  ~ 

s  1  +  e-^r— 


+ 


SCT 


+  -+r 


(28) 


Under  hypothesis  Ho ,  a  false  alarm  occurs  anytime  R  >  T,  therefore  the  probability  of  false 
alarm  is 


r  oo 

Pfa  =  /  /(fl|tf0)dil 

JT 

r  (fi-r)2  ,  f 

1  Z100  /'0O  e[-^r-+- 


/•OO  /*0 

Jt  J  0 


.  ini — u  .  n 

(1  +  e  »  )2r 


■drdR 


(29) 


\Z2n(jv  , 

Similarly,  Under  hypothesis  J/i,  when  R  >  T,  the  target  is  detectable.  Therefore  the  proba¬ 
bility  of  detection  is 


Pd  =  /°°/(i?|tfi)dfl 

JT 

2  /•OO  /*0 

V2tujv  Jt  Jo 


g  (  2 1/2  *  <r  J 

(1  +  e  *  )2(r  -  s) 


drdR 


(30) 


6  Conclusion 

On  a  basis  of  2  sets  of  foliage  clutter  data  using  UWB  radar,  we  show  that  it  is  more  accurate 
to  describe  the  amplitude  of  foliage  clutter  using  log-logistic  statistic  model  rather  than  log¬ 
normal,  Weibull,  or  Nakagami.  Log-normal  is  also  acceptable.  The  goodness-of-fit  for  Weibull 
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is  worse  whereas  that  of  Nakagami  is  the  worst.  Our  contribution  is  not  only  the  new  proposal 
on  the  foliage  clutter  model  with  estimated  concrete  parameters,  but  also  provide  the  criteria 
and  approaches  based  on  which  the  statistical  analysis  is  deduced.  Further,  the  theoretical 
study  about  the  probability  of  detection  as  well  as  the  probability  of  false  alarm  at  the  presence 
of  foliage  clutter  and  white  gaussian  noise  is  discussed.  Future  research  will  investigate  the 
characteristics  of  targets  and  the  design  of  radar  receiver  to  better  achieve  the  target  detection, 
tracking  and  imaging. 
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Table  1:  Estimated  Parameters  for  Data  Set  I 


PDF 

Log-Logistic 

Log-normal 

Weibull 

Nakagami 

data  1 

p  =  7.24161 
a  =  1.06483 
£M  =  0.0141212 
£(j  =  0.00724181 

p  =  7.0455 
a  =  2.20761 
£m  =  0.0174527 
e<7  =  0.0123415 

d  =  2975.33 
b  =  0.594979 

Ea  =  41.6157 
£6  =  0.00356925 

A  =  0.177062 

Cj  =  9.09663e  +  007 

Ejx  =  0.00150615 

Eu  =  1.70907e  +  006 

data  2 

p  =  6.9716 
a  =  1.2126 
=  0.014747 
£„  =  0.00773723 

p  =  6.72573 
a  =  2.33617 
£tl  =  0.0184691 
£<7  =  0.0130602 

a  =  2285.13 

b  =  0.563747 
£a  =  33.7127 
£6  =  0.00337485 

A  =  0.162375 

Cj  =  7.4776e  +  007 
£„  =  0.00137422 
eu  =  1.46679e  +  006 

data  3 

p  =  7.00554 
a  =  1.10741 
=  0.0145728 
ea  =  0.0076303 

p  =  6.76262 
<7  =  2.31258 
£tl  =  0.0182825 
£ct  =  0.0129283 

a  =  2341.52 
b  =  0.57073 
£a  =  34.1207 
£b  =  0.00341448 

A  =  0.164695 
u  =  7.46366e  +  007 
£m  =  0.001395 
£u  =  1.45459e  +  006 

data  4 

p  =  7.03055 
a  =  1.07858 
=  0.0142027 
=  0.00741556 

A  =  6.80711 
a  =  2.25973 
£n  =  0.0178647 
£ff  =  0.0126329 

a  =  2395.85 
b  =  0.579381 
£a  =  34.4066 
£6  =  0.00345156 

A  =  0.167391 

Cj  =  7.4926e  +  007 

Ep  =  0.0014916 
ew  =  1.44727e  +  006 

data  5 

p  =  7.16226 
a  =  1.10132 
£m  =  0.014605 
£ct  =  0.00750067 

A  =  6.95712 
a  =  2.26592 
£m  =  0.0179137 
£<7  =  0.0126675 

a  =  2806.76 
b  =  0.577823 
£a  =  40.4226 
£6  =  0.00347389 

A  =  0.17112 
u  =  9.03298e  +  007 
=  0.00145265 
£w  =  1.72749e  +  006 

data  6 

p  =  7.01527 
a  =  1.10123 
£„  =  0.0144902 
£a  =  0.00758568 

A  =  6.77515 
a  =  2.30286 
£„  =  0.0182057 
£a  =  0.012874 

a  =  2360.33 

b  =  0.572749 
£„  =  34.2753 
£b  =  0.00342376 

A  =  0.165292 

Cj  =  7.50824e  +  007 
£m  =  0.00140035 
eu  =  1.46145e  +  006 

data  7 

p  =  7.14523 
&  =  1.09486 
£m  =  0.0145132 
£<7  =  0.00745994 

A  =  6.94201 
a  =  2.25621 
£^  =  0.0178369 
£a  =  0.0126132 

d  =  2753.69 
b  =  0.578948 
£a  =  39.585 
£b  =  0.00347442 

A  =  0.170964 

Cj  =  8.80474e  +  007 

Eu  =  0.00145125 
eu  =  1.68382e  +  006 

data  8 

A  =  6.95411 
&  =  1.11486 
£m  =  0.0146774 
£<7  =  0.00768003 

A  =  6.71591 
a  =  2.31898 
£m  =  0.0183331 
£CT  =  0.0129641 

a  =  2250.66 

b  =  0.564989 
£0  =  33.1387 

Eb  =  0.0033763 

A  =  0.162448 

Cj  =  7.31436e  +  007 
=  0.00137488 
eu  =  1.4338e  +  006 

data  9 

p  =  7.18561 
a  =  1.09854 
£^  =  0.0145483 
£a  =  0.00749265 

A  =  6.9715 
a  =  2.27088 
£fi  =  0.0179529 
£<7  =  0.0126952 

a  =  2840.72 
b  =  0.581219 

Ea  =  40.6593 
£6  =  0.0034984 

A  =  0.172324 

Cj  =  8.97304e  +  007 
£m  =  0.00146348 
ew  =  1.70923e  +  006 

data  10 

(i  =  7.192 

d  =  1.0866 
=  0.0144166 
=  0.0073916 

fi  =  6.99196 
d  =  2.23975 
=  0.0177067 
=  0.0125211 

a  =  2869.65 

b  =  0.584803 
£a  =  40.837 
£b  =  0.00351294 

A  =  0.173572 

Cj  =  9.01631e  +  007 
£p  =  0.0014747 
su  =  1.71 142e  +  006 
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Table  2:  Averaged  Estimated  Parameters  for  Data  Set  I 


PDF 

Log-Logistic 

Log-normal 

Weibull 

Nakagami 

average 

A  =  7.0904 
<5-  =  1.1061 
£ ,,  =  0.0145 
£„  =  0.0075 

A  =  6.8695 
a  =  2.2771 

En  —  0.0180 

=  0.0127 

(i  =  2588 
b  =  0.5769 
ea  =  37.4316 

Eb  =  0.0035 

A  =  0.1687 
u  =  8.218e  +  007 
=  0.0014 

Efjj  —  1.4905e  T  006 

Table  3:  Estimated  and  Averaged  Parameters  for  Data  Set  II 


PDF 

Log-Logistic 

Log-normal 

Weibull 

Nakagami 

data  1 

A  =  7.76868 
a  =  0.786511 

En  =  0.0107792 

Ea  =  0.00521601 

A  =  7.79566 
a  =  1.41771 
£m  =  0.011208 

Ea  =  0.00792559 

a  =  4901.07 

b  =  0.743223 
£a  =  55.3011 
£b  =  0.00434465 

A  =  0.239587 
u>  =  1.16839e  +  008 

Efi  =  0.00207912 
eu  =  1.88719e  +  006 

data  2 

A  =  7.78096 
a  =  0.787426 
£m  =  0.0107917 

Ea  =  0.0052213 

A  =  7.8046 
<7  =  1.41855 
=  0.0112147 
ea  =  0.00793033 

a  =  4942.48 

b  =  0.745233 
£a  =  55.6114 
£b  =  0.0043612 

A  =  0.240593 
w  =  1.17237e  +  008 
e^i  =  0.00208848 

Eu,  =  1.88953e  +  006 

average 

A  =  7.7748 
a  =  0.7870 
£m  =  0.0108 

Ea  =  0.0052 

A  =  7.7881 
a  =  1.4181 
£m  =  0.0112 

Ea  =  0.0079 

a  =  4921.8 

b  =  0.7442 
£a  =  55.4565 
£b  =  0.0044 

A  =  0.2401 
u)  =  1.1704  +  008 

Eu  =  0.0021 

Eu,  =  1.8884  +  006 
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Figure  5:  This  figure  shows  the  lift  with  the  experiment.  The  antennas  are  at  the  far  end 
of  the  lift  from  the  viewer  under  the  roof  that  was  built  to  shield  the  equipment  from  the 
elements.  This  picture  was  taken  in  September  with  the  foliage  largely  still  present.  The 
cables  coming  from  the  lift  are  a  ground  cable  to  an  earth  ground  and  one  of  4  tethers  used  in 
windy  conditions. 
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Figure  6:  Clutter  data  (a)transmitted  pulse  before  antenna  amplification  (b)  an  example  of 
received  echoes  from  data  set  I,  and  (c)  an  example  of  received  echoes  from  data  set  II 
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Figure  7:  Expanded  view  from  clutter  samples  10,000  to  12,000  (a)  from  data  set  I,  and  (b) 
from  data  set  II 
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Figure  9:  Clutter  model  comparison  from  data  set  II  (a)  log-Logistic  vs.  log-normal,  and 
(b)  log-logistic  vs.  Weibull  (c)  log-logistic  vs.  Nakagami.  RMSEio9_io9istic  =  2.739  x  1CT5, 
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Abstract —  In  this  paper,  we  study  sense-through-foliage  target 
detection.  When  radar  echoes  are  in  good  quality,  the  detection  of 
target  can  be  achieved  by  applying  short  time  Fourier  transform 
(STFT)  to  the  received  UWB  radar  waveform.  We  compared 
our  approach  in  case  of  no  target  as  well  as  with  target  against 
the  scheme  in  which  2-D  image  was  created  via  adding  voltages 
with  the  appropriate  time  offset.  Results  show  that  our  approach 
can  detect  target  more  easily.  When  radar  echoes  are  in  poor 
condition  and  single  radar  is  unable  to  carry  out  the  detection,  we 
employ  both  Radar  Sensor  Networks  (RSN)  and  RAKE  structure 
to  combine  the  echoes  from  different  radar  members  and  finally 
detect  the  target. 

I.  Introduction 

Detection  and  identification  of  military  equipment  in  a 
strong  clutter  background,  such  as  foliage,  soil  cover  or 
building  leads  has  been  a  long-standing  subject  of  intensive 
study.  It  is  believed  that  solving  the  target  detection  through 
foliage  will  significantly  benefit  sense-through-wall  and  many 
other  subsurface  sensing  problems.  However,  to  this  date,  the 
detection  of  foliage-covered  military  targets,  such  as  artillery, 
tanks,  trucks  and  other  weapons  with  the  required  probability 
of  detection  and  false  alarm  still  remains  a  challenging  issue. 
This  is  due  to  the  following  facts: 

1)  Given  certain  low  radar  cross  section(RCS),  scattering 
from  tree  trunk  and  ground  reflectivity  may  largely 
overwhelm  the  returned  target  signals  of  interest,  thus 
very  high  multiple  fading  may  severely  corrupt  the 
amplitude  and  phase  of  the  echoes. 

2)  Even  if  the  target  is  stationery,  tree  leaves  and  branches 
are  likely  to  swing  in  result  of  gust,  which  will  result  in 
doppler  shift  of  clutter  and  difficulty  of  target  detection. 

Over  the  past  two  decades,  following  3  types  of  signals  have 
been  mainly  studied  to  examine  the  performance  on  target 
detection  in  foliage: 

1)  Traditional  sinusoidal  waveforms  at  VHF  through  UHF 
bands  [1  ],  as  the  lower  the  radar  frequency,  the  lower  the 
attenuation  and  scattering  from  branches  and  trees,  and 
thus  better  penetration  through  foliage.  However,  these 
approaches  result  in  low  resolution  and  low  RCS. 


2)  Millimeter-Wave  (MMW)  radars  are  used  in  [2]  [3] 
and  [4].  Results  demonstrate  the  potential  for  satisfying 
performance  but  need  further  investigation. 

3)  Relatively  low  frequency  Ultra- wide  band  (UWB)  radars 
between  100  MHZ  and  3  GHz  are  frequently  employed 
in  recent  years  owning  to  the  characteristics  provided  by 
their  high  resolutions  as  well  as  the  very  good  ability  of 
penetration,  such  as  penetrating  walls  [5]  [6].  Despite 
comparatively  short  detection  range,  UWB  signal  would 
have  advantages  over  a  narrowband  signal  with  limited 
frequency  content. 

In  this  paper,  we  will  apply  our  expertise  in  signal  process¬ 
ing,  data  fusion,  radar  sensor  networks  (RSN)  etc.  to  achieve 
effective  target  detection  in  foliage  using  ultra-wideband 
(UWB)  radar. 

The  remainder  of  this  paper  is  organized  as  follows.  In 
Section  II,  we  summarize  the  measurement  and  collection  of 
data  used  in  this  study.  In  Section  III,  we  propose  a  short  time 
Fourier  transform  (STFT)  based  approach  for  through-foliage 
target  detection  when  the  signal  quality  is  good.  In  Section 
IV,  we  propose  RSN  and  RAKE  structure  for  target  detection 
in  foliage  when  the  signal  quality  is  poor.  We  conclude  this 
paper  and  discuss  some  future  research  topics  in  Section  V. 

II.  Data  Measurement  and  Collection 

Our  work  is  based  on  the  foliage  penetration  field  test  data 
from  Air  Force  Research  Lab  [7].  The  foliage  penetration 
measurement  effort  began  in  August  2005  and  continued 
through  December  2005.  Working  in  August  through  the  fall 
of  2005,  the  foliage  measured  included  late  summer  foliage 
and  fall  and  early  winter  foliage.  Late  summer  foliage,  because 
of  the  limited  rainfall,  involved  foliage  with  decreased  water 
content.  Late  fall  and  winter  measurements  involved  largely 
defoliated  but  dense  forest. 

The  foliage  experiment  was  constructed  on  a  seven-ton  man 
lift,  which  had  a  total  lifting  capacity  of  450  kg.  The  limit 
of  the  lifting  capacity  was  reached  during  the  experiment 
as  essentially  the  entire  measuring  apparatus  was  placed  on 
the  lift.  The  principle  pieces  of  equipment  secured  on  the 
lift  are:  Barth  pulser,  Tektronix  model  7704  B  oscilloscope. 


dual  antenna  mounting  stand,  two  antennas,  rack  system,  IBM 
laptop,  HP  signal  Generator,  Custom  RF  switch  and  power 
supply  and  Weather  shield  (small  hut). 

The  target  is  a  trihedral  reflector  with  a  slant  length  of  1.5 
meters(as  shown  in  Fig.  1).  It  is  250  feet  one-way  distance 
from  the  antenna.  Throughout  this  work,  a  Barth  pulse  source 
(Barth  Electronics,  Inc.  model  732  GL)  was  used.  The  pulse 
generator  uses  a  coaxial  reed  switch  to  discharge  a  charge  line 
for  a  very  fast  rise  time  pulse  outputs.  The  model  732  pulse 
generator  provides  pulses  of  less  than  50  picoseconds  (ps)  rise 
time,  with  amplitude  from  150  V  to  greater  than  2  KV  into  any 
load  impedance  through  a  50  ohm  coaxial  line.  The  generator 
is  capable  of  producing  pulses  with  a  minimum  width  of  750 
ps  and  a  maximum  of  1  microsecond.  This  output  pulse  width 
is  determined  by  charge  line  length  for  rectangular  pulses,  or 
by  capacitors  for  1/e  decay  pulses. 


Fig.  1.  The  target  (a  trihedral  reflector)  is  shown  on  the  stand  at  300  feet 
from  the  lift. 

For  the  data  we  used  in  this  paper,  each  sample  is  spaced  at 
50  picosecond  interval,  and  16,000  samples  were  collected  for 
each  collection  for  a  total  time  duration  of  0.8  microseconds. 
We  considered  two  sets  of  data  from  this  experiment.  Initially, 
the  Barth  pulse  source  was  operated  at  low  amplitude  and 
significant  pulse-to-pulse  variability  was  noted  for  these  col¬ 
lections.  We  refer  this  set  of  collections  as  “poor”  signal.  The 
scheme  for  the  target  detection  in  foliage  with  “poor”  signal 
quality  will  be  presented  in  Section  IV.  Later,  good  signal 
quality  data  were  collected  using  higher  amplitude  pulses  and 
100  pulses  reflected  signals  were  averaged  for  each  collection. 
The  scheme  for  target  detection  with  “good”  signal  quality  will 
be  presented  in  Section  III. 

III.  Target  Detection  with  Good  Signal  Quality: 

A  STFT-based  approach 

In  Fig.  2,  we  plot  two  collections  with  good  signal  quality, 
one  without  a  target  on  range  (Fig.  2a)  and  the  other  one  with 
a  target  on  range  (Fig.  2b  and  target  appears  at  around  sample 
14,000).  To  make  it  more  clear  to  the  readers,  we  provide 
expanded  views  of  traces  (with  target)  from  sample  13,001  to 
15,000  for  the  above  two  collections  in  Figs.  3a  and  3b.  The 


target  response  will  be  the  echo  difference  between  Fig.  3b  and 
Fig.  3a,  which  is  plotted  in  Fig.  3c.  However,  in  a  practical 
situation  we  either  obtain  Fig.  3a  (clutter  echo  without  target) 
or  Fig.  3b  (target  on  range)  without  the  knowledge  about  the 
presence  of  a  target.  The  challenge  is  how  can  we  make  target 
detection  only  based  on  Fig.  3b  (with  target)  or  Fig.  3a  (no 
target)? 
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Fig.  2.  Measurement  with  very  good  signal  quality  and  100  pulses  integration, 
(a)  no  target  on  range  (b)  with  target  on  range  (target  appears  at  around  sample 
14000) 

To  solve  this  problem,  a  scheme  is  previously  proposed  in 
[8],  where  2-D  image  was  created  via  adding  voltages  with  the 
appropriate  time  offset.  In  Figs.  4(a)  and  4(b),  we  plot  the  2-D 
image  created  based  on  the  above  two  data  sets  (from  samples 
13,800  to  14,200)  using  the  approach  in  [8].  However,  from 
these  two  figures,  it’s  not  obvious  that  which  image  shows 
there  is  a  target  on  range. 

Although  Fig.  3a  and  3b  look  quite  similar  at  first  sight, 
after  careful  observation,  it  is  not  difficult  to  find  that  the 
sample  strength  change  more  abruptly  where  target  appears 
(around  sample  14000),  which  implies  that  echo  from  tar¬ 
get  contains  more  AC  values  than  that  without  target.  This 
phenomenon  inspires  the  application  of  short  time  Fourier 
transform  (STFT),  that  is  using  a  slide  window  to  determine 
the  sinusoidal  frequency  and  phase  content  of  a  signal  as  it 
changes  over  time.  This  form  of  the  Fourier  transform  has 
a  great  many  applications  in  sonar  and  radar  processing.  We 
will  show  that  STFT-based  approach  is  able  to  make  target 
detection  more  intuitively  and  easily. 


Fig.  3.  Measurement  with  good  signal  quality  and  100  pulses  integration 
(a)  Expanded  view  of  traces  (no  target)  from  samples  13001  to  15000  (b) 
Expanded  view  of  traces  (with  target)  from  samples  13001  to  15000  (c) 
Expanded  view  of  traces  difference  between  with  and  without  target 
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Fig.  4.  2-D  image  created  via  adding  voltages  with  the  appropriate  time 

offset  (a)  no  target  (b)  with  target  in  the  field 


values  (m  >  4) 

L  —  1 

p(m)  =1  F(m’w)2  I  (2) 

iu=4 

We  plot  the  power  of  AC  values  P(m )  versus  time  domain 
sample  index  in  Fig.  5a  and  5b  for  the  data  sets  in  Figs.  2a 
and  2b  respectively.  We  can  see  that  at  the  samples  where 
there  is  a  target,  the  curve  of  the  power  signal  looks  like  a 
Gaussian  pdf  other  than  chaotic  impulses.  And  thus  it  is  quite 
straightforward  to  see  that  there  is  no  target  on  range  in  Fig. 
5a. 

It’s  worth  mentioning  that  for  better  visual  inspection, 
window  length  and  step  size  may  change  on  a  basis  of  different 
radar  data. 


The  discrete  STFT  can  be  expressed  as 

N- 1 

F(m,w)  =  ^  r(n)w(n  -  m)e~]wn  (1) 

n= o 

where  r(n)  is  radar  measurement  and  w(n)  is  the  window 
function.  We  apply  rectangular  window,  with  its  length  L  =  30 
and  step  size  M  =  16.  Then  we  cumulate  the  power  of  AC 


IV.  Target  Detection  with  Poor  Signal  Quality: 

Radar  Sensor  Network  and  differential-based 
approach 

As  mentioned  in  Section  II,  when  the  Barth  pulse  source 
was  operated  at  low  amplitude  and  the  sample  values  are 
not  obtained  based  on  sufficient  pulse  response  averaging 
(averaged  over  35  pulses  for  each  collection),  significant  pulse- 
to-pulse  variability  was  noted  and  the  return  signal  quality  is 
poor.  Fig.  6  illustrate  the  received  echoes  in  this  situation. 


(a) 


Fig.  5.  The  power  of  AC  values  versus  sample  index,  (a)  no  target  (b)  with 
target  in  the  field 

Even  with  the  application  of  the  STFT-based  scheme  proposed 
above,  we  can  not  tell  whether  there  is  a  target  or  not  in  the 
range.  Since  pulse-to-pulse  variability  exists  in  the  echoes  at 
different  time  or  different  site,  this  motivate  us  to  explore  the 
spatial  and  time  diversity  using  Radar  Sensor  Networks  (RSN). 

In  nature,  a  network  of  multiple  radar  sensors  can  been  uti¬ 
lized  to  combat  performance  degradation  of  single  radar  [10]. 
These  radar  sensors  are  managed  by  an  intelligent  clusterhead 
that  combines  waveform  diversity  in  order  to  satisfy  the  com¬ 
mon  goals  of  the  network  other  than  each  radar  operate  sub¬ 
stantively.  As  radar  sensors  are  environment  dependent  [11], 
it  may  provide  better  signal  quality  if  different  radars  work 
collaboratively  to  perform  data  fusion.  For  example,  consider 
a  system  of  two  radars.  When  the  signal  of  either  radar 
unfortunately  experience  a  severe  fading,  if  two  radars  are 
spaced  sufficiently  far  apart,  it  is  not  likely  that  both  of  the 
radars  experience  deep  fade  at  the  same  time.  By  selecting 
better  waveform  from  the  two  radar  waveforms,  the  data  is 
less  likely  to  be  lost. 

In  this  paper,  we  assume  the  radar  sensors  are  synchronized 
in  RSN  and  we  employed  RAKE  structure  to  combine  received 
information  for  RSN.  The  detailed  process  is  shown  in  Fig. 
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Fig.  6.  Measurement  with  poor  signal  quality  (a)  Expanded  view  of  traces 
(no  target)  from  samples  13001  to  15000  (b)  Expanded  view  of  traces  (with 
target)  from  samples  13001  to  15000  (c)  Expanded  view  of  traces  difference 
between  with  and  without  target 

8.  The  echo,  i.e.,  RF  response  by  the  pulse  of  each  cluster- 
member  radar  sensor,  will  be  combined  by  the  clusterhead 
using  a  weighted  average,  and  the  weight  A*  is  determined  by 
the  power  of  each  echo  Xi(m)  (m  is  the  sample  index), 

A  =  v^-ET  (3) 


Ei  =  var(xj(m))  +  [mean(ii(m))]2 


(4) 
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Fig.  7.  The  power  of  AC  values  versus  sample  index,  (a)  no  target  (b)  with 
target  in  the  field 

As  for  STFT,  we  apply  window  length  L  =  25  and  step  size 
M  =  15.  We  ran  simulations  for  n  =  35  and  plot  the  power 
of  combined  signal  obtained  through  STFT  based  approach  in 
Fig.  7.  Compare  this  figure  with  Fig.  6a  and  Fig.  6b,  it  is  quite 
obvious  to  see  that  there  is  a  target  around  sample  14,000. 

V.  Conclusion  and  Future  Works 

In  this  paper,  we  propose  a  short  time  Fourier  transform 
(STFT)-based  signal  processing  approach  on  received  UWB 
Radar  waveforms  to  improve  through-foliage  target  detection. 
The  foliage  penetration  measurements  were  taken  in  Holliston, 
Massachusetts.  When  radar  echoes  are  in  good  quality,  the 
detection  of  target  can  be  achieved  by  applying  STFT-based 
technology  to  single  radar.  We  compared  our  approach  in  case 
of  no  target  as  well  as  with  target  against  the  scheme  in 
which  2-D  image  was  created  via  adding  voltages  with  the 
appropriate  time  offset.  Results  show  that  our  approach  can 
detect  target  more  intuitively.  When  radar  echoes  are  in  poor 
condition  and  single  radar  is  unable  to  carry  out  detection, 
we  employe  both  Radar  Sensor  Networks  (RSN)  and  RAKE 
structure  to  combine  the  echoes  from  different  radar  members 
and  finally  successfully  detect  the  target.  For  future  works. 


At 


Fig.  8.  Block  diagram  of  STFT-based  approach  and  diversity  combination 
in  RSN 

we  will  collect  more  data  with  different  targets  and  perform 
automatic  target  recognition  besides  target  detection. 
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Abstract 

Energy  conservation  is  critical  in  Wireless  Sensor 
Networks.  Replacing  or  recharging  batteries  is  not  an 
option  for  sensors  deployed  in  hostile  environments. 
Generally  communication  electronics  in  the  sensor  utilizes 
most  energy.  This  paper  studies  the  effect  of  changing  the 
transmission  power  and  baud  rate  on  transmission 
distance.  Using  Shannon  channel  capacity  formula  and 
Log  -  Distance  Path  Loss  Model ,  transmission  distance  is 
shown  to  be  related  to  transmit  power  and  baud  rate. 
Extensive  empirical  readings  are  taken  to  confirm  the 
above  relation.  The  path  loss  exponent  got  as  a  result  of 
data  fitting  is  within  the  acceptable  range  for  wireless 
environment.  Using  the  equation  derived  in  this  paper ,  the 
distance  between  neighboring  motes  and  traffic  density  it 
will  be  possible  for  sensors  to  adjust  their  transmit  power 
and  baud  rate  so  as  to  use  only  the  required  amount  of 
energy  to  maintain  the  wireless  link  to  the  neighbor  and 
conserve  power. 

I.  Introduction 

Wireless  Sensor  Networks  (WSN)  are  comprised  of 
small,  inexpensive  sensors  with  wireless  communication 
capabilities,  called  motes.  They  are  deployed  in  ad  -  hoc 
networks  and  are  powered  by  limited  power  supplies. 
These  motes  are  deployed  in  large  numbers  and  provide 
unprecedented  opportunities  for  instrumenting  and 
controlling  homes,  cities  and  the  environment.  They  find 
applications  in  different  fields  like  military  sensing, 
physical  security,  air  traffic  control,  traffic  monitoring, 
video  surveillance,  industrial  automation  etc.  Each  poses 
different  challenges  for  these  motes  but  one  common 
challenge  faced  in  all  fields  is  power  conservation.  This  is 
because  motes  are  sometimes  deployed  in  difficult  to 
reach  regions  and  this  makes  it  difficult  to  replace  the 
batteries.  Hence  power  conservation  becomes  an 
important  factor  for  these  motes.  One  of  the  main  reasons 
for  deploying  these  motes  in  ad  -  hoc  is  power 
conservation.  Power  is  consumed  during  data  processing 
and  RF  communication,  but  communication  electronics 
uses  far  more  power  than  processing.  Hence  a  lot  of  effort 
goes  into  designing  energy  efficient  routing  algorithms 
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[1,  2,  3],  directed  diffusion  algorithms  [4,  5],  clustering 
algorithms  [6,  7,  8],  data  aggregation  [9]  and  MAC 
protocols  [10,  11,  12].  Many  of  these  algorithms  assume 
fixed  transmit  power  and  baud  rate. 

In  this  paper  we  study  the  effect  of  changing  the 
transmit  power  and  baud  rate  on  transmission  distance. 
This  study  is  particularly  useful  for  WSN’s  which  are 
mobile,  such  as  in  applications  involving  robotic  swarms 
[19].  We  experimentally  determine  a  relation  between 
distance,  transmit  power  and  baud  rate  for  Crossbow’s 
Mica2  motes  deployed  in  indoor  environment.  This  can  be 
incorporated  along  with  the  above  algorithms  and  with 
knowledge  of  the  distance  to  neighboring  mote  and  the 
traffic  density  can  be  used  to  adjust  the  transmit  power 
and  baud  rate  so  that  packets  can  be  forwarded  to  the 
neighboring  mote  with  the  least  energy  and  maximum 
reliability.  Adjusting  the  transmit  power  the  neighboring 
mote  will  fall  within  the  transmission  range  and  neither 
will  be  out  of  range  nor  will  be  the  transmission  range 
much  bigger  than  the  desired  distance.  This  will  also  help 
to  keep  packet  exchange  to  the  desired  mote  and  not  to  any 
distant  mote,  which  can  itself  start  some  other  packet 
exchange  and  so  will  help  keep  interference  to  a 
minimum.  This  study  along  with  the  energy  efficient 
algorithms  will  further  increase  energy  efficiency  in  real 
time  when  deployed  on  Mica2  motes. 

This  paper  is  organized  as  follows.  In  section  II  we 
briefly  describe  the  factors  affecting  transmission 
distance.  Section  III  gives  the  details  of  our  experiment 
and  the  empirical  data  plots  obtained.  In  section  IV  curve 
and  surface  fitting  is  described  and  lastly  in  section  V  we 
provide  a  conclusion  of  our  study. 

II.  Transmission  Distance 

The  study  of  Transmission  Distance,  Transmit  Power 
and  Baud  Rate  done  for  this  paper  is  related  to  Energy 
Conservation  in  Wireless  Sensor  Network.  Transmit 
Power  and  Baud  Rate  together  affect  the  Transmission 
Distance.  In  a  wireless  channel  there  is  some  power  loss 
called  Path  Loss.  This  path  loss  is  inversely  proportional 
to  the  distance  between  the  transmitter  and  receiver.  With 
the  knowledge  of  distance  and  path  loss  we  can  find  the 
exact  transmit  power  required  to  maintain  a  good  link  to 
the  receiver  without  wasting  any  energy. 
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Another  factor  that  affects  the  transmission  distance  is 
the  antenna  sensitivity.  Baud  rate  of  the  data  affects  the 
receiver  antenna  sensitivity.  This  is  because  at  the  higher 
baud  rate,  there  is  less  energy  and  a  fewer  number  of 
actual  radio  waves  in  each  bit  of  information  transmitted. 
The  lower  baud  rates  have  more  energy  per  bit  and  more 
actual  radio  waves  per  bit  transmitted.  The  more  radio 
waves  received  per  bit  makes  it  much  easier  to  establish 
the  correct  waveform.  Thus,  lower  baud  rates  mean  more 
energy  per  bit;  better  receive  signals  and  longer 
transmission  distances  [13].  The  advantage  is  by  keeping 
the  transmit  power  constant  and  only  decreasing  the  baud 
rate  we  can  increase  the  transmission  distance  of  the  radio. 

III.  Experiment  Details 

The  following  hardware  was  used  for  the  experiment 

1.  MIB510  Programming  board  manufactured  by 
Crossbow  Technology. 

2.  Mica2  motes  2  nos.  manufactured  by  Crossbow 
Technology 

For  the  radio  communications  Mica2  uses  the  CC1000 
Chip  which  works  in  the  433  MHz  band.  The  usual 
antenna  chosen  is  a  length  of  insulated  wire  called  the 
monopole  whip  antenna  one  -  quarter  wavelength 
long.  This  happens  to  be  6.8  inches  for  the  433  MHz 
Mica2  [14].  Mica2  is  powered  by  two  AA  batteries 
and  requires  a  voltage  between  2.7  -  3.3  Volts  for 
successful  operation. 

A.  CC 1 000  transceiver  details 
CC1000  uses  the  Binary  Frequency  Shift  Keying 
(BFSK)  modulation  in  the  physical  layer.  Its  transmit 
power  and  the  baud  rate  are  completely  programmable. 
The  crystal  oscillator  connected  has  a  frequency  of 
14.7456  MHz.  There  are  22  inbuilt  8  -  bit  registers  which 
can  be  used  to  program  the  operating  parameters.  A  few 
relevant  parameters  are  as  follows. 

Frequency  Control  Word  A  which  sets  the  local 
oscillator  frequency  in  receive  mode  and  Frequency 
Control  Word  B  which  sets  the  transmitting  frequency,  f0 
in  the  transmit  mode,  together  operate  the  frequency 
synthesizer  (PLL).  In  the  experiment  Word  A  had  a  value 
of  0x580000  and  B  had  a  value  of  0x57F685.  Two 
registers  FSEP0  and  1  are  used  to  set  the  frequency 
separation  and  they  hold  a  value  of  0x355.  Transmit  and 
receive  frequencies  can  be  calculated  using  the  following 
formula  [15]. 

r  _  r  FREQ  +  {FSEP  xTXDA 7M)  +  8192  m 

Jno  -  /-/*  16384 

Here y;ro  gives  the  Local  Oscillator  (LO)  frequency  in 
receive  mode  and  the  f0  and  fj  frequency  in  transmit  mode 


(lower  and  upper  FSK  frequency).  /  is  the  reference 

frequency  calculated  using  the  formula  given  below. 
FREQ  is  the  value  in  frequency  control  word  A  or  B 
according  to  whether  the  formula  is  used  to  calculate 
receive  or  transmit  frequency.  FSEP  is  the  value  in  the 
FSEP0  and  1  registers.  TXDATA  is  0  or  1  in  transmit  mode 
depending  on  the  data  bit  to  be  transmitted.  In  the  receive 
mode  TXDATA  is  always  0.  Now  to  calculate  fnj  we  use 

the  following  formula  [15] 

/  =  f*osc  (2) 

nf  REFDIV 

fxosc  ls  the  crystal  oscillator  frequency  of  14.7456  MHz 
and  REFDIV  is  set  to  12.  This  gives  fn/  =  1 .2288  MHz. 

Using  all  these  values  in  eq.  1,  in  receive  mode 
fvco  =433. 152x1 06  Hz  and  in  transmit  mode 


fvco  =  432.96997  xl06//z. 


Thus  the  LO  frequency  in  the  receive  mode  is 
433.152  MHz.  In  the  experiments  we  have  used  a  high 
side  LO  injection.  So  fVco  =  fRF  +  fiF  where  fRF  is  the 
centre  frequency  and  fiF  is  the  Intermediate  frequency.  For 
CC1000  fIF  is  designed  to  be  150.0375  KHz.  So  we  get 
fRF  =  433.0019625  MHz. 


The  upper  FSK  transmit  frequency  is  given  by 
fi  =  fo  +  fSep-  fo  is  the  fvco  frequency  calculated  above  for 
transmit  mode  i.e.  432.96997  MHz  and  fscp  is  the 
frequency  separation  calculated  using  the  formula  [15] 
given  below. 


fsep  f r 


rtf  ' 


FSEP 

16384 


(3) 


Substituting  the  values  /  =  63975//z,  further  using  this 


fsep  we  get  fj  =  433.03395  MHz.  From  f0  and  f\  the  centre 
frequency  will  be  433.0019625  MHz  and  is  same  as  in  the 
receive  mode. 

The  transmit  power  of  CC1000  can  be  varied  from 
-20  dBm  to  10  dBm  in  steps  of  1  dB  and  is  controlled  by 
the  PA  POW  register.  We  carried  out  our  experiments  at 
four  different  power  level  setting  as  shown  in  Table  1 . 

Another  parameter  that  was  required  for  our 
experiments  is  the  baud  rate.  This  is  set  using  a  part  of 
MODEMO  register  as  shown  in  Table  2.  For  our 
experiments  we  used  19.2,  38.4  and  76.8  kBaud. 
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Table  1:  PA_POW  register  values  and  corresponding 
output  power  [15]  _ 


Output  Power 
(dBm) 

PA  POW  Register  Value 
(hex) 

-20 

01 

-19 

01 

-18 

02 

-17 

02 

-16 

02 

-15 

03 

-14 

03 

-13 

03 

-12 

04 

-11 

04 

Table  2:  MODEMQ  register  part  to  set  Baud  Rate 


Register 

Part  Name 

Bit  &  Parameter  Value 

MODEMO{6:4> 

BAUDRATE{2:0} 

000:  0.6  kBaud 

001:  1.2  kBaud 

010:  2.4  kBaud 

011:  4.8  kBaud 

100:  9.6  kBaud 

101:  19.2,38.4  and  76.8  kBaud 
110:  Not  Used 

111:  Not  Used 

15] 


The  experiment  was  carried  out  in  a  big  warehouse 
which  was  used  as  a  lab  with  minimal  office  furniture.  The 
Mica2  motes  were  placed  on  the  ground  and  were  always 
in  line  of  sight  of  each  other.  The  following  picture  was 
taken  during  the  experiment. 


(a)  (b) 

Figure  1 :  Experimental  Site  (a)  Front  view  (b)  Top  view  of 
motes 

For  the  transmitter  mote  a  certain  power  level  and  baud 
rate  is  set  from  the  possible  options  available.  Since  the 
wireless  channel  varies  a  lot  randomly  with  respect  to  time 
there  will  be  some  packet  loss  even  when  the  receiver 
mote  is  within  the  transmission  distance.  Hence  the 
transmitter  mote  sends  a  total  of  4000  packets,  so  as  to 
follow  Monte  Carlo  method,  each  with  a  sequence 
number.  The  receiver  mote  keeps  a  record  of  the  number 
of  packets  received  and  missed  and  finally  transmits  a 


summary  packet  to  the  computer  attached  to  it.  A 
resolution  of  5  inches  was  used  for  the  distance  between 
the  transmitter  and  receiver.  Once  it  is  found  that  the 
receiver  is  not  receiving  any  packets  at  all  we  know  that 
the  receiver  is  out  of  the  transmission  range.  When  it  is 
absolutely  sure  the  receiver  is  out  of  range  the  transmit 
power  or  baud  rate  on  the  transmitter  mote  are  changed  to 
a  different  value  and  the  above  experiment  carried  again. 
The  plots  in  Figure  2  were  generated  using  the  data 
gathered  from  the  experiment. 


(b) 
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(C) 


Oaianct  m  (inch**) 


(d) 

Figure  2:  Packet  success  in  percentage  as  a  function  of 
transmission  distance  for  power  levels  of  (a)  -19.5  dBm, 
(b)  -17  dBm,  (c)  -14  dBm,  (d)  -1 1.5  dBm  and  for 
19.2  kBaud,  38.4  kBaud  and  76.8  kBaud 

Each  curve  from  the  plots  in  Figure  2  is  initially  in  the 
region  of  90  -  100  %  this  is  because  the  transmitter  and 
receiver  motes  are  within  the  transmission  range.  A  few 
packets  are  dropped  but  this  could  be  because  of  variations 
in  the  channel  that  occur  for  a  very  small  amount  of  time 
and  is  normal  for  any  wireless  channel.  As  the  distance 
between  the  transmitter  and  receiver  is  increased,  after  a 
certain  distance  the  curve  drops  drastically  and  the  packets 
received  are  in  the  range  of  0  -  5  %.  This  implies  the 
transmitter  is  out  of  range  of  the  receiver  mote. 

IV.  Data  Fitting 

The  empirical  data  is  thought  to  be  derivable  from  some 
underlying  function.  The  plots  in  section  III  seem  to 
closely  follow  the  Q  -  function.  We  used  curve  fitting 
using  method  of  least  squares  to  minimize  the  sum  of 


squared  deviations.  After  fitting  Q  -  function  to  each 
curve  following  plots  were  generated.  All  curve  fits  are 
not  shown  in  Figure  3  but  the  data  got  from  all  curve  fits  is 
shown  in  Table  3. 
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Figure  3:  Q-function  curve  fit  at  19.2  kBaud  &  -19.5  dBm 

An  initial  value  of  one  was  used  for  std.  dev.  for  curve 
fit  and  for  mean  of  distance  a  value  where  packet  success 
equals  50%  was  used.  The  following  table  shows  the 
parameters  got  from  the  curve  fit  in  above  plots. 


Table  3:  Mean,  Std.  Dev.  and  Residual  Error  of  curves  fit 


Baud 

Rate 

(kBaud) 

Transmit 

Power 

(dBm) 

Curve  Parameters 
(Inches) 

Residual 

Error 

76.8 

-19.5 

Mean  = 

149.6819 

208.8363 

Std  Dev  = 

3.093 

-17.0 

Mean  = 

167.642 

78.0394 

Std  Dev  = 

4.567 

-  14.0 

Mean  = 

222.2506 

41.3694 

Std  Dev  = 

1.0526 

-  11.5 

Mean  = 

302.4448 

148.6817 

Std  Dev  = 

1.0754 

38.4 

-19.5 

Mean  = 

168.0964 

57.4137 

Std  Dev  = 

3.2833 

-17.0 

Mean  = 

191.759 

79.4251 

Std  Dev  = 

0.9316 

-  14.0 

Mean  = 

256.6509 

324.2136 

Std  Dev  = 

1.1637 

-  11.5 

Mean  = 

328.0941 

115.0494 

Std  Dev  = 

1.3191 

19.2 

-19.5 

Mean  = 

190.7525 

50.2595 

Std  Dev  = 

1.8992 

-17.0 

Mean  = 

239.1081 

11.9331 

Std  Dev  = 

2.0253 

-  14.0 

Mean  = 

292.7798 

80.6525 

Std  Dev  = 

1.3931 

-  11.5 

Mean  = 

353.1259 

35.7488 

Std  Dev  = 

0.9846 
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For  all  the  above  std.  dev.  of  distance  the 
average  =  1.899  in.  The  maximum  error  is  2.668  in.  This 
is  the  error  in  the  std.  dev.  of  Q  -  function  and  not  in  the 
transmission  distance.  In  transmission  distance  the  error 
will  be  2.668/2  =  1.334  in.  This  is  because  only  the 
distance  between  mean  and  monotonically  decreasing  part 
of  Q  -  function  is  included  while  measuring  transmission 
distance. 

The  mean  of  the  distance  in  Table  3  is  related  to  channel 
bandwidth  and  frequency  of  operation  apart  from  transmit 
power  and  baud  rate.  This  is  three  dimensional  and  we  use 
surface  fit  as  mentioned  below. 

A  very  important  result  derived  by  Shannon  [16]  is 


C=  Bc  xlog2 


1  + 


(JV.xBJ 


(4) 


Here  C  is  the  channel  capacity  in  bits  per  seconds,  Bc  is 


the  channel  bandwidth  in  Hz,  Pr  is  the  received  power  in 
watts  and  N0  is  the  single  -  sided  noise  power  density  in 


10  X  log,,, 

1 000  xN0x  Bc 

(  —  V 

2*'  -1 

l  ). 

watts  /  Hz.  Writing  in  terms  of Pr  and  converting  it  to 
dBm  we  get 

r  r  c  \  1 

(5) 

In  the  wireless  channel  the  received  power  is  calculated 
using  the  formula  [17]  below. 

Pr  (d \dBm ]  =  Pt [dBm )-  PL(d )[dB ]  (6) 

Here  />r(c/)is  the  received  power  in  dBm  at  a  distanced 
from  transmitter,/)  is  the  transmit  power  in  dBm 
and  PL{d )  is  the  path  loss  in  dB  at  a  distance  d  from 
transmitter.  For  finding  the  path  loss  .PZ(d)  we  use  the 
Log  -  Distance  Path  Loss  model  [17]  where 

(7) 

In  this  formula  Pl\dB\  is  the  path  loss  in  dB,  PL(d0 )  is 
the  path  loss  at  a  reference  distance  d0,  n  is  the  path  loss 
exponent,  d  is  distance  in  meters  at  which  the  path  loss  is 
to  be  calculated  and  d0  is  the  reference  distance  also  in 
meters. 

Substituting  eq.  (7)  in  (6)  and  the  result  in  (5)  we  get 
d  "  ‘  1000 *NnxB, 


f 

r  <01 

jl0x/7Xlog,0 

UJI 

P,[dBm\-  PL(d{))~  jlO  x  n  x  log,()^—  |  \  =  10  x  log,, 

Solving  for  d  we  have 


'  c  V 

P,-PL(J  oMOxlog.0 

lOOOxA r0*Bc 

2Bc  -1 

d  =  d0  x  1 0 


(8) 


Thus  we  relate  the  transmission  distance  with  transmit 
power,  baud  rate  and  channel  bandwidth.  We  use  this 
equation  in  the  surface  fit  using  method  of  least  squares. 

In  the  above  equation  Pl(d0)  is  calculated  using  the 


Friis  free  space  path  loss  formula  [17]  given  below. 
^40=-ioxiog1( 


(4«^)2  xrf, 


2 

0  . 


(9) 


Here  A  is  the  wavelength  andd0  is  the  reference  distance. 

Centre  frequency  of  the  motes  is  433.002  MHz,  as 
calculated  in  section  III,  from  which  we  get  A  and  the 
distance  d0  it  is  said  should  be  in  the  far  -  field  region  of 


the  transmitting  antenna.  This  far  -  field  region  or 
Fraunhofer  region  of  a  transmitting  antenna  is  defined  as 


the  region  beyond  the  far  -  field  distance  d f  which  is 

related  to  the  antenna  dimension  and  wavelength.  It  is 
given  as  follows  [17] 

2  xD2 


df=- 


(10) 


Where  d  f  is  the  far  -  field  distance,  D  is  the  largest 

physical  linear  dimension  of  the  antenna  and  A  is  the 
wavelength. 

The  Mica2  motes  have  a  monopole  whip  antenna  which 
is  At 4  meters  high.  Using  433.002  MHz  frequency  we  get 
A  =  0.6928  m  =  27.2767  in  and  the  whip  antenna 
height  =  0.1732  m  =  6.8  in.  So  D  -  0.1732  m.  Putting 
these  values  in  eq.  (10)  we  get^  =  0.0861 16  m  =  3.3903  in 

Additional  criteria  to  be  in  the  far  -  field  region  are  d f 
must  satisfy 

df))D  i.e.  df)) 6.8  in  and 
df))A  i.e.  rf, >>27.2767  in. 

So  to  be  in  the  far  -  field  region  we  choose  our  distance 
d0  to  be  100  in  =  2.54  m  and  from  the  A  value  got  above 

we  calculate  PL(d0)  and  use  it  in  eq.  (8).  The  N0  in 
eq.  (8)  is  calculated  using  N0  =  kxT  .  A*  is  the  Boltzmann 

constant  =  1.3806503x1 0‘23  m2  kg  s'2  K"1  and  T  is  the 
room  temperature  in  degree  kelvin  =  300°  k.  The  channel 
bandwidth  for  433  MHz  Mica2  mote,  Bc  =  620kHz  [18]. 

Using  all  these  values  in  eq.  (8)  we  try  to  find  the  best 
values  for  n  the  path  loss  exponent  such  that  the  error, 
between  the  distances  calculated  using  eq.  (8)  and  the 
values  of  distance  we  got  from  our  experimental  data,  is 
minimized.  To  start  with,  a  value  of  one  is  assumed  for  n. 
After  doing  the  surface  fit  we  get  the  following  graphs. 
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Plot  of  M«»n  of  Dtttance  at  a  funcbon  of  Baud  Rale  and  Tranimt  Power  using  Empirical  Data 


Plot  of  Mean  of  Distance  at  a  function  of  Baud  Rate  and  Transmit  Power  using  Surface  fit  Squared  2-Norm  of  the  Residual  *  6971  7456 

n  *  4.1045 


(b) 

Figure  4:  Mean  of  Distance  vs  Transmit  Power  and  Baud 
Rate  using  (a)  Empirical  Data  (b)  eq.  (8)  derived  from 
Shannon  theorem  and  Path  Loss  Model 

The  optimized  value  of  path  loss  exponent  n  got  from 
surface  fit  is  n  =  4.10.  The  following  graph  compares 
distances  using  both  data  and  also  shows  the  surface. 

Pi«  a  M«»r  ol  Distant .  as  a  kmclion  a  Baud  Rata  and  Trantmd  Po*tr  using  Surface  ft  Squared  2  Norm  of  the  Residua  *  6871  7458 


In  the  plot  above  the  white  bars  show  the  distances  got 
using  the  empirical  data  at  baud  rates  of  19.2,  38.4  and 
76.8  kBaud.  The  black  bars  show  the  distance  got  using 
eq.  (8).  So  the  difference  between  the  two  is  the  error.  For 
comparison  purpose  the  table  below  shows  the  empirical 
distance,  distance  obtained  using  eq.  (8),  the  absolute  error 
between  the  two  and  the  percentage  error. 


Table  4:  Summary  of  mean  of  distance  and  absolute  error 


Empirical  Data 

Transmit 

Baud  Rate 

Power 

19.2  kBaud 

38.4  kBaud 

78.6  kBaud 

-  19.5  dBm 

190.7526 

168.0964 

149.6819 

-17.0  dBm 

239.1081 

191.7589 

167.6420 

-14.0  dBm 

292.7798 

256.6509 

222.2506 

-11.5  dBm 

353.1259 

328.0941 

302.4448 

Shannon  Formula  Surface  Fit  -  n  =  4.10 

Transmit 

Baud  Rate 

Power 

19.2  kBaud 

38.4  kBaud 

78.6  kBaud 

-  19.5  dBm 

223.0249 

187.8755 

157.8458 

-17.0  dBm 

256.6032 

216.1618 

181.6109 

-14.0  dBm 

303.6356 

255.7817 

214.8980 

-11.5  dBm 

349.3505 

294.2918 

247.2528 

Absolute  Error 

Transmit 

Baud  Rate 

Power 

19.2  kBaud 

38.4  kBaud 

78.6  kBaud 

-  19.5  dBm 

32.2723 

19.7791 

8.1639 

-17.0  dBm 

17.4952 

24.4028 

13.9689 

-14.0  dBm 

10.8558 

0.8692 

7.3526 

-11.5  dBm 

3.7754 

33.8022 

55.1921 

Max  Error 

-  55.1921 

Min  Error  -  0.8692 

Percentage  Error 

Transmit 

Baud  Rate 

Power 

19.2  kBaud 

38.4  kBaud 

78.6  kBaud 

-  19.5  dBm 

16.9184 

11.7665 

5.4542 

-17.0  dBm 

7.3168 

12.7258 

8.3326 

-14.0  dBm 

3.7078 

0.3387 

3.3083 

-11.5  dBm 

1.0691 

10.3026 

18.2486 

Note:  All  distances  in  Inches. 


The  error  in  the  above  table  is  very  small.  A  close 
comparison  of  empirical  data  and  surface  fit  data  reveals  a 
mix  of  positive  and  negative  errors  in  mean  of  distance 
which  will  reflect  in  the  same  way  in  transmission 
distance.  When  this  equation  is  used  in  a  networking 
protocol  most  of  the  time  the  transmit  power  and  baud  rate 
set  will  maintain  a  good  link  to  the  receiver.  The  bad  link 
occurring,  in  spite  of  setting  the  transmit  power  and  baud 
rate  as  per  eq(8),  can  happen  also  because  of  the  dramatic 
change  in  the  environment  for  short  or  long  period  of  time. 
In  this  case  the  transmit  power  could  be  slightly  increased 
or  the  baud  rate  slightly  decreased  to  maintain  a  good  link. 
Once  done  these  new  values  can  be  used  throughout  the 
lifetime  of  the  network. 
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Tuning  of  the  transmission  power  and  baud  rate  will 
only  increase  the  network  setting  time  a  little  but 
afterwards  throughout  the  operation  of  the  network  energy 
will  be  conserved.  The  equation  in  this  paper  definitely 
gives  a  better  starting  point  at  fixing  the  transmit  power 
and  baud  rate  rather  than  approximately  varying  them 
from  minimum  to  maximum  and  wasting  a  lot  of  time  for 
network  setup. 

V.  Conclusion 

The  key  challenge  in  Wireless  Sensor  Networks  being 
power  conservation,  the  study  in  this  thesis  will  support 
multi  -  hop  routing  and  clustering  algorithms  in  increasing 
the  power  saving  in  each  sensor  node.  Using  Shannon 
theorem  and  Log  -  Distance  Path  Loss  Model  the 
transmission  distance  is  shown  to  be  related  to  transmit 
power  and  baud  rate.  The  extensive  empirical  data 
obtained  confirms  to  this  relation  of  transmission  distance. 
The  optimized  value  of  the  path  loss  exponent  obtained  as 
a  result  of  surface  fitting  is  close  to  that  obtained  in  other 
wireless  environments.  Using  this  relation  along  with  the 
knowledge  of  localization  and  traffic  density  between  the 
neighboring  mote  it  will  be  possible  to  adjust  the  transmit 
power  and  /  or  the  baud  rate  to  maintain  the  wireless  links 
between  the  neighboring  motes  without  loosing  the 
connection  and  without  incurring  huge  energy  costs. 

Apart  from  the  advantage  of  power  saving,  the  correct 
transmit  power  level  will  decrease  the  level  of  interference 
in  the  network  since  less  motes  in  the  surrounding  will 
hear  the  conversation.  There  is  also  an  increase  in  the 
network  capacity  as  the  packet  transmissions  are  confined 
only  to  the  small  local  area  and  other  motes  out  of  the  area 
are  free  to  carry  out  their  own  transmissions  with  some 
other  motes  within  their  area. 
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Abstract — In  this  paper,  we  study  the  problem  of  minimum 
connected  dominating  set  in  random  geometric  fc-disk  graphs. 
This  research  is  motivated  by  the  problem  of  virtual  backbone 
construction  in  wireless  ad  hoc  and  sensor  networks,  where 
the  coverage  area  of  nodes  are  disks  with  different  radii.  We 
derive  the  size  relationship  of  any  maximal  independent  set  and 
minimum  connected  dominating  set  in  geometric  A>disk  graphs, 
and  apply  it  to  analyze  the  performances  of  two  distributed 
connected  dominating  set  algorithms  we  propose  in  this  paper. 
These  algorithms  have  bounded  performance  ratio  and  low 
communication  overhead,  and  therefore  have  the  potential  to  be 
applied  in  real  ad  hoc  and  sensor  networks. 

Index  Terms — connected  dominating  set,  random  geometric  A> 
disk  graphs,  maximal  independent  set,  performance  ratio. 

I.  Introduction 

Let  V  be  the  set  of  points  denoting  the  set  of  nodes 
randomly  placed  in  the  Euclidean  plane.  Associated  with 
Vix  €  V  is  a  radius  ru  such  that  1  <  ru  <  /c,  where  k  is 
a  constant.  Let  Gk{V,E )  be  the  disk  graph  constructed  from 
V  such  that  an  edge  (u,v)  E  E  if  and  only  if  the  distance 
between  u  and  v  is  at  most  min{ru,  rv}.  We  call  G *  a  random 
geometric  k-disk  graph  or  a  geometric  k-disk  graph.  When 
k  =  1,  Gk  is  a  unit-disk  graph. 

Geometric  /c-disk  graphs  have  been  widely  adopted  to 
model  wireless  ad  hoc  and  sensor  networks,  in  which  two 
nodes  can  communicate  with  each  other  successfully  via  two- 
way  handshake  (DATA-ACK)  if  they  reside  in  each  other’s 
transmission  range.  Since  their  power  capacity  differs  or  for 
the  reason  of  interference  mitigation,  nodes  may  have  different 
transmission  ranges,  resulting  in  disks  with  different  radii. 

In  this  paper  we  study  the  problem  of  constructing  min¬ 
imum  connected  dominating  sets  (MCDS)  in  geometric  k- 
disk  graphs.  One  significant  application  of  this  problem  is  the 
virtual  backbone  construction  in  ad  hoc  and  sensor  networks, 
where  a  virtue  backbone  is  utilized  to  decrease  the  protocol 
overhead  [1],  [2].  Computing  a  MCDS  in  Gk{V,E)  is  a 
NP-hard  problem  since  its  special  case,  MCDS  in  unit-disk 


graphs,  is  NP-hard  [3].  In  this  paper,  we  first  figure  out  the 
size  relationship  between  any  maximal  independent  set  (MIS) 
and  the  minimum  connected  dominating  set  in  (7^.  Then  we 
report  two  distributed  approximation  algorithms  for  connected 
dominating  set  construction. 

Our  major  contributions  are  three  folds.  First,  we  prove 
that  the  maximum  number  of  independent  neighbors  a  node 
may  have  in  a  geometric  /c-disk  graph  is  at  most  5  + 
9  T  in(2  cos(  — ))  1  *  Second,  we  derive  the  size  relationship  be¬ 
tween  any  MIS  and  the  MCDS  in  geometric  /c-disk  graphs, 
which  is  (7rin(2‘cnos(f))l  +4)opt  +  2rrn-(21cn08fc(f))1+l,  where 
opt  is  the  size  of  any  MCDS.  Note  that  the' best  results  in 
literature  were  reported  by  Thai  et  al.  in  [4],  which  stated  that 
when  k  >  1,  each  node  has  at  most  10[^(2l”^s  ^  J  number 
of  independent  neighbors  and  the  size  of  any  MI§  is  at  most 
19[in(2cos z-)^opi’  Notice  that  our  results  work  for  unit-disk 
graphs  too,  where  k  =  1. 

The  third  contribution  of  this  paper  is  the  two  distributed 
approximation  algorithms,  for  which  we  conduct  rigorous 
theoretical  performance  analysis.  In  the  first  algorithm,  we 
grow  a  CDS  from  a  leader.  This  algorithm  requires  each  node 
know  the  ids  of  its  neighbors.  In  the  second  algorithm,  we 
construct  an  MIS  first  then  find  out  connectors  to  connect 
all  nodes  in  the  MIS.  This  algorithm  takes  node  degree  into 
consideration  and  therefore  has  higher  message  overhead  but 
achieves  better  performance  compared  to  the  first  one. 

This  paper  is  organized  as  follows.  We  first  introduce  the 
preliminary  definitions  in  Section  II.  Related  works  are  also 
surveyed  in  this  section.  Then  we  derive  the  size  relationship 
between  maximal  independent  set  and  minimum  connected 
dominating  set  in  geometric  A;-disk  graphs  in  Section  III. 
The  two  distributed  approximation  algorithms  are  reported  in 
sections  IV  and  V,  respectively.  We  conclude  this  paper  with 
a  brief  summary  and  a  brief  discussion  on  future  research  in 
Section  VI. 
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II.  Preliminaries  And  Related  Work 

A.  Preliminaries 

Given  any  graph  G  =  (L,  E),  two  vertices  are  independent 
if  they  are  not  neighbors.  For  V  u,v  €  V,  hop_count(u,v) 
is  the  number  of  edges  (hops)  in  the  shortest  path  from  u 
to  v.  For  any  vertex  v,  N\  [t>]  =  {u  |  hop_count(u,v)  < 
1}  is  the  one-hop  (close)  neighbor  set  of  v\  = 

{u  |  hop_count(u,v)  <  2}  is  the  two-hop  (close)  neighbor 
set  of  v. 

An  independent  neighbor  set  of  v ,  denoted  by  Ni(v)y  is 
a  subset  of  Ni[v]  such  that  any  pair  of  vertices  in  Nj(v)  are 
independent.  An  independent  set  S  of  G  is  a  subset  of  V  such 
that  for  V  u,v  G  5,  (u,  v)  £  E.  S  is  maximal  if  any  vertex 
not  in  S  has  a  neighbor  in  S. 

A  dominating  set  D  of  G  is  a  subset  of  V  such  that  each 
node  not  in  D  has  at  least  one  neighbor  in  D.  For  any  edge 
(ti,  v),  if  u  €  D  and  v  £  Dy  then  u  is  v's  dominator  and  v  is 
u's  dominatee.  If  both  u  and  v  are  €  D,  one  can  specify  the 
other  as  its  dominator.  An  optimal  dominating  set  has  mini¬ 
mum  cardinality.  If  the  induced  subgraph  of  D  is  connected, 
then  D  is  a  connected  dominating  set  (CDS).  Among  all 
CDSs  of  graph  G,  the  one  with  minimum  cardinality  is  called 
a  minimum  connected  dominating  set  (MCDS).  A  maximal 
independent  set  is  also  a  dominating  set. 

B.  Related  Work 

To  our  best  knowledge,  Thai  et  al.  [4]  was  the  first  and 
the  only  one  to  tackle  the  problem  of  MCDS  construction  in 
geometric  fc-disk  graphs.  In  that  work,  the  authors  proved  that 
that  any  node  can  have  at  most  10  L  ^(2  "os  g)  J  niimber  of  inde¬ 
pendent  neighbors  when  k  >  1.  This  result  is  problematic  as 
when  k  is  a  little  more  than  1,  it  yields  0,  which  is  impossible. 
Thai  et  al.  [4]  also  proposed  three  centralized  approximation 
algorithms  together  with  their  performance  analysis.  Our  work 
is  motivated  by  [4]  but  improves  [4].  In  addition,  we  focus  on 
the  design  of  distributed  algorithms  for  the  geometric  /c-disk 
graphs. 

There  exist  abundant  works  investigating  the  problem  of 
MCDS  construction  in  unit-disk  graphs  and  general  graphs. 
For  a  recent  literature  survey,  we  refer  the  readers  to  [  1  ]  and 
the  references  therein.  In  the  following  we  briefly  summarize 
several  major  works. 

The  NP-Completeness  of  MCDS  in  general  graphs  was 
studied  in  [5].  The  MCDS  remains  NP-hard  for  unit-disk 
graphs  [3].  In  1998,  Guha  and  Khuller  proposed  two  CDS  con¬ 
struction  strategies  in  their  seminar  work  [6].  These  two  greedy 
heuristic  algorithms  have  performance  ratios  2(1  +  H( A)), 
where  A  the  largest  node  degree  and  H  is  the  harmonic 
function,  and  3  +  In  A,  respectively.  Ruan  et  al.  [7]  proposed  a 
one-step  greedy  approximation  algorithm  with  a  performance 
ratio  of  2  +  In  A.  Wu  and  Li  [8]  proposed  the  first  localized 
algorithm  for  MCDS  in  general  graphs  but  their  algorithm 
does  not  have  a  bounded  performance  guarantee  [9]. 

For  unit-disk  graphs,  Wu  et  al.  [  10]  studied  the  size  relation¬ 
ship  between  MCDS  and  MIS  and  reported  that  the  size  of  any 
MIS  is  at  most  3.8  times  of  that  of  MCDS  plus  a  constant  1.2. 
With  this  result,  the  performance  ratio  for  the  approximation 


algorithms  in  [9],  [11],  [12],  which  connect  a  MIS  with  a 
spanning  tree,  is  improved  from  8  to  7.8.  Li  et  al.  [13]  designed 
the  algorithm  of  connecting  an  MIS  with  a  Steiner  tree,  and 
achieved  a  performance  ratio  of  5.8  +  In  4.  Note  that  all  the 
above-mentioned  works  require  a  MIS  satisfying  the  follwing 
property:  any  subset  of  the  MIS  is  two-hop  away  from  its 
complementary.  The  construction  of  such  a  MIS  relies  on  a 
spanning  tree.  The  works  by  Wan  et  al.  [14]  and  Cheng  et 
al.  [2]  removed  this  restriction  with  a  tradeoff  of  a  larger 
performance  ratio.  MCDS  in  unit-disk  graphs  has  a  polynomial 
time  approximation  scheme  [15],  which  means  that  MCDS  in 
unit-disk  graphs  can  be  approximated  to  any  degree. 

The  research  reported  in  this  paper  is  motivated  by  [2] 
and  [4]  and  we  target  the  geometric  /c-disk  graphs.  To  our 
knowledge,  the  results  reported  in  this  paper  are  the  best  in 
literature. 


III.  Maximal  Independent  Set  and  Minimum 
Connected  Dominating  Set  in  Geometric  /c-disk 
Graphs 

Given  Gk{V,  E)y  a  geometric  /c-disk  graph  as  defined  in 
Sec  I,  let  Nj(x)  be  the  set  of  independent  neighbors  of  x  for 
for  Vx  €  V.  We  have 

Lemma  3.1:  |jV/(x)|  ^  5  +  9rin(2cos(f))l- 

Proof:  Denote  by  rx  the  radius  of  x.  Let  a  be  a  real 
number  such  that  0  <  a  <  For  easier  elaboration  we 
assume  that  ^  is  a  constant  and  we  set  a  =  £.  We  will 
explain  at  the  end  of  the  proof  that  this  a  value  is  a  reasonable 
choice.  Note  that  the  whole  proof  procedure  only  requires 
0<Q<  f. 

We  first  draw  circles  centered  at  x  with  radii  (2  cos  a)0, 
2  cos  a,  (2  cos  a)2,  •  •  •,  (2cosa)Ua_1,  and  (2cosa)na,  respec¬ 
tively,  such  that  (2cosa)n-_1  <  rx  and  (2cosa)n-  >  rx. 
Therefore  ns  =  [n^osa)!-  By  this  way  the  disk  of  x 
is  partitioned  into  ns  annuluses  plus  the  unit-radius  circle 
centered  at  x,  as  shown  in  Fig.  1 .  We  claim  that  each  annulus 
contains  at  most  ~  -  1  nodes  in  Ni(x). 

Consider  any  annulus  7t,  as  shown  in  Fig.  1(a).  Let  u  and 
v  be  two  independent  neighbors  of  x  in  7 i.  We  must  have 
/uxv  >  a.  To  prove  this  claim,  we  draw  two  lines  xb  and  xd 
crossing  the  inner  and  outer  sides  of  H  at  a  and  6,  and  c  and  d, 
respectively,  such  that  u  resides  in  xb  and  Z6xd  =  a.  Since 
|§£  =  1§^|  =  2  cos  a,  we  have  |c6|  =  \ad\  =  |xa|  =  |xc|. 
Therefore  \yx\  >  \yb\  holds  true  for  any  point  y  in  the  closed 
area  abdc  of  7d,  which  means  all  nodes  in  the  closed  area  of 
abdc  in  7 i  are  neighbors  of  u.  Thus  v  must  be  out  of  the  area 
of  abdc ,  and  therefore  /.uxv  >  a.  Based  on  this  argument,  we 
conclude  that  there  exist  at  most  ~  —  1  number  of  independent 
neighbors  of  x  in  the  annulus  H. 

Since  there  are  ns  number  of  annuluses,  the  total  number 
of  independent  neighbors  of  x  in  all  the  annuluses  is  at  most 

».  ■  t¥  - 11  =  rnfeji  •  IS  - 1|.  u.  m  -  rnferi  ■ 

[77  —  !]•  When  0  <  a  <  /(a)  yields  a  minimum  when  a 

is  close  to  tL 

Now  consider  the  unit  disk  V  centered  at  x,  as  shown  in 
Fig.  1(b).  Let  u  and  v  be  two  independent  neighbors  of  x. 
Then  we  must  have  /uxv  >  60°,  since  otherwise,  |ui>|  < 
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max{|wx|,  |t^a:| }  <  1,  contradicting  the  independence  of  u  and 
v .  Therefore  there  exist  at  most  5  independent  neighbors  of  x 
at  V. 

Considering  the  unit-disk  area  and  all  the  annulus  areas,  we 
have  \Nl(x)\  <  5  +  9 [ ^(^cJsC f ))  1  -  5  +  9rin(2cos(f ))!•  m 


Fig.  1.  (a)  If  u  and  v  are  independent  neighbors  in  the  annulus  H ,  then 

Zuxv  >  q.  Note  that  only  part  of  TL  is  shown,  (b)  When  u  and  v  are 
independent  neighbors  in  the  unit-disk  area  of  x,  Zuxv  >  60°. 


In  the  following  analysis,  we  denote  by 

Now  we  have  derived  the  upper  bound  of  the  number  of 
independent  neighbors  for  any  node  x  in  Gk{V,E)>  which 
is  5  +  9 n*;.  In  the  next,  we  are  going  to  answer  the  following 
question:  how  many  independent  neighbors  a  pair  of  neigh¬ 
boring  nodes  may  have?  The  following  lemma  answers  this 
question  by  giving  an  upper  bound. 

Lemma  3.2:  Let  u  and  v  be  any  pair  of  neighboring  nodes 
in  Gk(V,E),  i.e.  u  e  V,  v  €  V,  and  (u,v)  €  E.  Then  the 
total  number  of  independent  neighbors  of  u  and  v  is  at  most 
9  +  16nfc. 

Proof:  Let  ru  and  rv  be  the  radii  of  u  and  vy  respectively. 
Without  loss  of  generality,  we  assume  ru  >  rv.  Assume  the 
disks  of  u  and  v  intersect  at  x  and  y ,  as  illustrated  in  Fig.  2. 
We  have  \ux\  =  ru  and  \vx\  =  rv.  Since  \ux\  >  \vx\  and 
\ux\  >  \uv\%  we  have  /.uvx  >  Similarly  we  have  /.uvy  > 
Therefore  /.xvy  >  Thus  the  area  in  v’s  disk  that  could 
cover  nodes  in  Nj(v)  but  not  in  Nj(u)  is  the  right  sector  xvy 
of  v  with  an  angle  at  most  . 

Now  we  can  take  the  same  procedure  as  that  in  Lemma  3. 1 
by  drawing  circles  centered  at  v  with  radii  1,  2  cos  a,  •  •  •.  Since 
the  angle  at  v  formed  by  any  two  independent  neighbors  of 
v  in  the  same  annulus  is  larger  than  a,  the  overlapping  area 
of  the  ri^ht  sector  xvy  and  all  the  annuluses  can  contain  at 
most  =  7rik  number  of  independent  neighbors  of 

v.  With  a  similar  argument,  the  intersection  of  the  unit-disk 
centered  at  v  and  the  right  sector  xvy  may  contain  at  most 
-jh  =  4  independent  neighbors.  Therefore  the  total  number  of 
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independent  neighbors  of  u  and  v  is  at  most  9  +  16nfc.  ■ 
Lemma  3.2  indicates  that  a  neighbor  can  bring  in  at  most 
4  -f  7nk  number  of  independent  nodes.  This  motivates  us  to 
prove  the  following  lemma  addressing  the  size  relationship  be¬ 


Fig.  2.  If  u  and  v  are  neighbors  in  Gk(V,  E)  and  ru  >  rv,  then  Z xvy  > 
3  * 


tween  a  minimum  connecting  dominating  set  and  any  maximal 
independent  set  of  the  same  Gfc. 

Let  S  C  V  be  any  MIS  of  Gk(V,E).  Denote  by  opt  the 
size  of  any  MCDS  in  G.  We  have 

Theorem  3.1:  |5|  <  (7n>t  +  4 )opt  -f  2 +  1. 

Proof:  Let  C  be  any  MCDS  of  Gk(V,E).  Compute  a 
spanning  tree  r  traversing  all  nodes  in  C.  Let  U\,U2,  •  •  •,  uopt 
be  the  preorder-tree-walk  of  r.  Then  from  Lemma  3.1  u\  will 
dominate  at  most  5  +  9 number  of  independent  nodes  in 
5.  Based  on  Lemma  3.2,  each  u*,  where  i  =  2,3, -  —  ,  opt, 
will  dominate  at  most  4  +  7 rik  number  of  independent  nodes 
in  5  that  are  not  dominated  by  any  node  in  Uj  where  j  <  i. 
Since  C  is  a  connected  dominating  set,  each  node  in  S  will 
be  either  in  C,  or  dominated  by  one  node  in  C.  Therefore 
I  S' |  <  5  +  9nfc  -f  (opt  —  1)(4  +  7rifc)  =  (4  +  7nk)opt  +  2rik  + 1. 

■ 

In  the  following  sections,  we  propose  two  distributed  CDS 
algorithms  and  analyze  their  performance  theoretically.  Note 
that  for  both  algorithms,  we  assume  that  each  node  has  a 
unique  id,  known  to  its  neighbors  within  two  hops  away. 

IV.  Algorithm  I:  Growing  A  CDS  from  A  Leader 

Algorithm  I  starts  from  a  leader  node.  For  simplicity,  we 
assume  that  the  node  with  the  smallest  id  is  the  leader.  We 
associate  a  color  with  each  node.  Strictly  speaking,  color  is 
not  a  parameter  in  our  algorithm.  It  is  retained  in  the  algorithm 
description  for  the  purpose  of  easier  elaboration. 

Initially  all  nodes  arc  colored  white.  In  the  first  step,  leader 
u  colors  itself  black  and  becomes  a  dominator.  Then  all  nodes 
in  N\  [u]  \  {u}  color  themselves  gray  and  become  dominatees, 
and  all  nodes  in  7V2[ti]  \  N\[u]  color  themselves  yellow  and 
become  active.  In  the  next  step,  neighboring  active  nodes 
compete  with  each  other  and  the  winners,  whose  ids  are  the 
smallest  among  their  yellow  neighbors,  become  dominators. 
Each  of  the  winners  also  specifies  its  own  dominator,  the  gray 
neighbor  whose  id  is  the  smallest  among  all  gray  neighbors. 
This  step  will  repeat  until  nodes  are  either  gray  (dominatees) 
or  black  (dominators).  All  black  nodes  form  a  CDS. 

Note  that  the  procedure  described  above  grows  a  CDS  from 
the  leader.  In  the  first  step,  only  the  leader  is  included  in  the 
CDS.  In  each  of  the  other  steps,  two  nodes  are  included  in  the 
CDS,  with  one  turning  color  from  yellow  and  one  from  grey. 

Theorem  4.1:  The  size  of  the  CDS  generated  by  Algorithm 
I  is  at  most  (14n^  -h  8) opt  -f  4rik  +  1,  where  opt  is  the  size 
of  the  minimum  connected  dominating  set. 
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Proof.  We  can  partition  all  the  dominators  into  two  sets:  A 
and  B.  Set  A  contains  all  vertices  with  color  changing  from 
white  to  black  directly  and  B  contains  all  vertices  with  color 
changing  from  white  to  gray  then  to  black.  The  first  step  adds 
the  leader  to  A.  Each  of  the  other  steps  adds  one  node  to  A 
and  one  node  to  B.  Thus  \A\  =  \B\  +  1. 

Now  we  claim  that  A  is  an  independent  set.  This  is 
obvious  since  each  vertex  u  in  A  is  colored  black  from 
white  or  yellow.  This  means  u  has  no  black  neighbors 
because  each  neighbor  of  a  black  node  has  gray  color. 
From  Lemma  3.1,  \A\  <  (7n^  +  4 )opt  +  2 +  1.  Thus 
|j4|  +  \B\  <  (14 nk  +  8 )opt  +  4 nk  +  1.  ■ 


Fig.  3.  The  state  transition  diagram  of  Algorithm  I  for  any  node  u. 


Remarks:  (i)  Algorithm  I  grows  a  tree  from  the  leader  in  a 
step-by-step  fashion.  At  any  time,  all  the  inner  nodes  of  the 
tree  are  colored  black  while  all  the  leaf  nodes  are  colored  gray. 
In  the  first  step,  the  leader  and  all  of  its  neighbors  are  added 
to  the  tree.  In  every  other  step,  a  leaf  node  v  and  one  of  its 
yellow  neighbors  u  are  colored  black  (added  to  the  tree).  All 
the  white/yellow  neighbors  of  u  and  v  are  colored  gray  and 
added  to  the  tree  as  leaves.  This  algorithm  terminates  when  no 
white/yellow  node  left.  All  the  black  nodes  form  the  CDS.  (ii) 
If  we  use  node  cost  instead  of  id  as  the  criteria  for  dominator 
selection.  Algorithm  I  is  cost-aware.  For  example,  if  the  cost 
is  the  inverse  of  the  residual  power  of  each  node,  the  output 
CDS  has  higher  power  capacity;  if  the  cost  is  the  incoming 
bitrate  (load),  the  output  CDS  has  lower  load;  if  the  cost  is  the 
inverse  of  the  node  velocity,  the  induced  graph  by  the  output 
CDS  has  a  more  stable  topology. 

A.  Distributed  Implementation 

Each  node  u  maintains  the  following  parameters:  domu> 
which  is  the  dominator,  or  the  parent  of  the  node  in  the 
tree;  ranku,  which  defines  a  relative  relationship  among 
neighboring  nodes1;  childrenUy  which  contains  all  dominated 
nodes,  or  the  children  of  the  node  in  the  tree.  These  parameters 
are  updated  by  the  exchange  of  the  following  3  messages. 

<  dominator (u,  domu,ranku)  >  —  node  uy  whose  dom¬ 
inator  is  domu  and  whose  rank  is  ranku,  broadcasts  this 
message  to  all  neighbors,  u  is  a  dominator. 

<  dominatee(u,  domu,  ranku)  >  —  node  u ,  whose  dom¬ 
inator  is  domu  and  whose  rank  is  ranku ,  broadcasts  this 
message  to  all  neighbors,  u  is  a  dominatee. 

<  active(u)  >  —  node  u  broadcasts  this  message  to  all  of 
its  white/yellow  neighbors  when  it  becomes  active.  A  white 
node  becomes  active  after  it  receives  the  first  <  dominatee  > 
message  from  one  of  its  neighbors. 

The  state  transition  diagram  of  Algorithm  I  is  given  in 
Figure  3.  Each  node  u  runs  a  copy  of  the  algorithm.  At  any 
time,  u  can  be  in  one  of  the  4  states:  So,  Si,  S2,  and  S3.  The 
directed  arc  from  Si  to  Sjy  where  i,j  =  0, 1,2,3,  represents 
the  transition  from  state  S*  to  state  Sj.  Each  transition  is 
labeled  by  a  number.  These  transitions  will  be  explained  latter. 

1  In  Algorithm  I,  rank  is  the  level  of  the  node  in  the  tree 


State  So  is  the  initial  state.  A  node  in  this  state  has  white 
color.  All  nodes  are  in  So  at  the  beginning  of  the  algorithm. 
State  Si  is  the  dominatee  state.  A  node  in  this  state  is  a 
dominatee  and  has  gray  color.  State  S2  is  the  active  state. 
A  node  in  this  state  has  at  least  one  neighbor  in  Si  and  has 
yellow  color.  An  active  node  is  a  candidate  dominator  in  next 
step.  State  S3  is  the  dominator  state.  A  node  in  this  state 
is  a  dominator  and  has  black  color.  All  nodes  in  S3  form 
the  connected  dominating  set.  Each  node  u  in  state  So  or  S2 
also  maintain  a  parameter  Wu  to  node  all  yellow  neighbors. 
Initially  Wu  =  0.  If  u  has  the  smallest  id  compared  to  its 
yellow  neighbors  in  WUy  u  will  become  a  dominator  in  the 
next  step.  The  transition  steps  are  detailed  below. 

1)  u  is  in  state  So-  If  u  is  the  leader,  then  domu  = 

uy  ranku  =  0.  u  will  broadcast  message  < 

dominator (u,  u,  0)  >  and  go  to  state  S3. 

2)  u  is  in  state  So  or  S2  and  receives  message  < 

dominator(v,d,l)  >  from  neighbor  v.  If  d  is  a  neigh¬ 
bor  of  uy  then  domu  =  dy  ranku  =  Z;  otherwise, 
domu  =  vy  ranku  =  l  +  1.  u  will  broadcast  message 

<  dominatee(u,domu, ranku)  >  and  go  to  Si. 

3)  u  is  in  state  So  and  receives  message 

<  dominatee(v,d,l)  >  from  neighbor  v.  u  will 
broadcast  message  <  active{u)  >  and  go  to  state  S2. 
u  temporarily  sets  domu  to  v  and  ranku  to  l  +  1.  If  v 
is  in  WUy  it  will  be  removed. 

4)  u  is  in  state  S2  and  receives  message 

<  dominatee(y,d,l)  >.  Remove  v  from  Wu  if 

it  is  in  If  idv  <  iddomu>  then  domu  =  v  and 
ranku  =  Z  - hi.  u  will  go  back  to  S2. 

5)  u  is  in  state  So  or  S2  and  receives  message  < 

active(v)  >.  Wu  =  W^U^}.  u  remains  in  the  original 
state. 

6)  u  is  in  state  S2  and  there  is  no  broadcasting  in  Ni[u] 
for  To  time  unit  (which  is  a  design  constant).  If  u  has 
smallest  id  compared  with  all  nodes  in  Wu ,  it  will 
broadcast  message  <  dominator (tz,  domu ,ranku)  > 
and  go  to  S3. 

7)  u  is  in  state  Si  and  receives  message 

<  dominator (Vydyl)  >  from  v.  If  d  =  uy  u  will 
broadcast  message  <  dominator (u,domu,ranku)  > 
and  go  to  state  S3. 

8)  u  is  in  state  S3.  If  u  finds  that  none  of  its  neighbors 


5 


takes  u  as  the  dominator,  u  will  broadcast  message 
dominatee(u,domu,ranku)  and  go  to  state  S\. 

Remarks:  (i)  The  leader  can  be  elected  by  the  distributed 
leader  election  algorithm  proposed  in  [16].  (ii)  The  parameter 
To  is  used  to  force  the  start  of  next  step  in  nearby  environment 
be  strictly  after  the  end  of  the  current  step.  To  is  a  design 
parameter,  which  could  be  predetermined  via  simulation,  (iii) 
We  assume  the  message  broadcast  is  reliable.  For  reliable 
broadcasting,  we  refer  the  readers  to  [17],  [18].  (iv)  When  u  re¬ 
ceives  a  message  <  dominator  (v,  u ,  l)  or  dominatee(v,  u,  l), 
u  will  put  v  into  its  children  list  if  v  is  not  there,  (v)  In  state 
Si,  a  gray  node  will  select  the  lowest  rank  black  node  as 
its  dominator.  This  can  help  to  optimize  the  generated  CDS. 
For  example,  if  a  black  node  us  dominated  children  are  also 
dominated  by  dom(u),  then  u  can  become  a  dominatee  and 
go  to  state  S\.  (vi)  Step  8  is  an  optimization  to  remove  those 
dominators  with  no  dominatees. 

Theorem  4.2:  Algorithm  I  has  a  message  complexity  O(n), 
where  n  is  the  total  number  of  vertices. 

Proof.  We  have  3  types  of  messages:  <  dominator  >, 
<  dominatee  >,  and  <  active  >.  Each  node  broadcasts 
each  message  at  most  once.  Thus  the  total  number  of 
broadcastings  is  at  most  0(n).  ■ 

Note  that  the  above  message  complexity  analysis  does  not 
consider  the  leader  election,  whose  message  complexity  is 
D(nlogn)  [14]. 

B.  An  example 

Now  we  use  an  example  to  demonstrate  how  to  apply 
Algorithm  I  to  compute  a  connected  dominating  set.  The  given 
geometric  disk  graph  G  =  (V,E)  is  shown  in  Figure  4.  There 
are  6  nodes  and  1 1  links.  Host  0  is  the  leader. 


Fig.  4.  An  example.  Unit-disk  graph  G  contains  9  nodes  and  12  links. 

For  this  example,  3  steps  are  needed  to  compute  the 
connected  dominated  set.  They  are  described  in  detail  below. 
0.  Initially  all  nodes  are  in  state  So. 

1.  Host  0  broadcasts  message  <  dominator( 0,0,0)  > 
and  goes  to  state  S3.  After  receiving  < 

dominator( 0,0,0)  >,  nodes  1  and  5  broadcast  messages 

<  dominatee(  1,0,1)  >  and  <  dominatee( 5,0,1)  >, 
respectively,  and  go  to  state  S\.  After  receiving 

<  dominatee (1,0, 1)  >  and/or  <  dominatee( 5, 0, 1)  >, 


nodes  2,  4,  and  6  broadcast  messages  <  active( 2)  >, 

<  active( 4)  >,  and  <  active(6)  >,  respectively,  and  go 
to  state  S2.  After  this  step,  the  tree  contains  inner  node 
{0}  and  leaves  {1,5}. 

2.  After  the  first  step,  W2,  We ,  WA  contain  {6},  {2,4}, 
and  {6},  respectively.  Therefore  node  2  and  4  will 
become  dominators  since  they  have  the  smallest  id 
compared  with  their  yellow  neighbors  in  W2  and  VV4, 
respectively.  Thus  nodes  2  and  4  broadcast  message 

<  dominator (2, 1,2)  >  and  <  dominator (4, 5, 2)  >, 
respectively,  and  go  to  state  S3.  Here  node  2  selects 
node  1  as  its  dominator  because  node  1  has  a 
smaller  id  compared  with  node  5.  After  receiving 

<  dominator ( 2,1,2)  >,  node  1  broadcasts  message 

<  dominator (1,0, 1)  >  and  goes  to  state  S3;  similarly 

node  5  broadcasts  message  <  dominator ( 5,0, 1)  >  and 
goes  to  state  S3.  After  receiving  <  dominator (2, 1,  2)  > 
and/or  <  dominator( 4, 5, 2)  >  and/or 

<  dominator  (b,0,\)  >,  nodes  3  and  6 

broadcast  messages  <  dominatee(  3,2,3)  >  and 

<  dominatee( 6,2,3)  >  respectively,  and  go  to  state 
Si.  After  this  step,  the  tree  contains  inner  nodes 
{0,1, 2, 4, 5}  and  leaves  {3,6}. 

3.  Node  4  has  no  dominatee  and  therefor  go  to  state  Si  to 
become  a  dominatee.  The  final  tree  contains  inner  nodes 
{0, 1,2,5}  and  leaves  {3,4,6},  as  shown  in  Fig.  5. 


Fig.  5.  The  computed  connected  dominating  set  contains  nodes  {0, 1,2, 5}. 
The  optimal  solution  contains  {5,6}. 


V.  Algorithm  II:  Connecting  A  Maximal 
Independent  Set 

Algorithm  I  could  be  improved  if  node  degree  is  taken  into 
consideration.  In  Algorithm  II,  instead  of  growing  a  CDS  from 
a  leader,  we  compute  a  MIS  first,  then  connect  all  nodes  in 
the  MIS.  Accordingly  Algorithm  II  contains  two  phases. 

In  Algorithm  II,  each  node  needs  to  maintain  two  more 
parameters:  the  effective  degree  d *,  which  is  defined  to  be  the 
number  of  white/yellow  neighbors,  and  the  black  degree  d*b, 
which  is  used  by  a  gray  node  to  record  the  number  of  black 
neighbors  with  a  higher  rank.  The  distributed  implementation 
of  algorithm  II  is  detailed  in  the  following  subsection. 


6 


A.  Distributed  Implementation 

The  state  transition  diagram  for  the  first  phase  is  shown  in 
Figure  6. 


Fig.  6.  The  stale  transition  diagram  [or  the  first  phase  oj  Algorithm  II  for 
any  node. 


Initially  all  nodes  are  colored  white.  During  the  execution 
of  phase  1,  each  white/yellow  node  keeps  track  of  its  effective 
degree  d*,  which  will  be  updated  when  a  neighbor  changes 
color.  To  keep  the  degree  information  up-to-date,  we  need 
a  new  message  <  degree(u,d*)  >,  by  which  u  tells  its 
neighbors  that  its  effective  degree  is  d* .  We  also  need  the 
message  <  blackdegree(u,d*b)  >,  which  is  broadcasted  by  u, 
a  dominatee.  The  “rank”  information  will  be  used  in  phase  2 
to  connect  all  members  in  the  generated  MIS.  The  transitions 
are  explained  below.  Initially  d*  =  d ,  where  d  is  the  degree 
whose  value  is  |iVi[it]|  —  1. 

1)  u  is  the  leader  and  is  in  state  S0.  u  will  broadcast 
message  <  dominator  [u,u,  0)  >  to  its  neighbors  and 
go  to  state  S3. 

2)  u  is  in  state  So  or  S2  and  receives 

<  dominator(v,x,l)  >  from  neighbor  v.  Here 
x  is  either  the  leader  or  null,  u  will  broadcast 

<  dominatee(u,v,l)  >  and  go  to  S\. 

3)  u  is  in  state  So  and  receives  the  message  < 
dominatee{v,d,l)  >.  u  will  update  d*  to  d*  —  1, 
removes  v  from  Wu  if  v  G  Wu,  set  ranku  to  /  +  1, 
broadcast  <  active(u )  >,  and  then  go  to  S2. 

4)  In  S2,  u  will  keep  track  of  all  <  dominatee(y,  d,  l)  > 
messages  broadcasted  in  N\[u\  and  remove  those  dom- 
inatees  in  Wu.  u  updates  d*  and  ranku  accordingly 
( ranku  =  1  +  maxv{l}).  If  there  is  no  broadcasting 
in  Ni[u]  for  T0  time  unit,  u  will  broadcast  message 

<  degree(u,d*)  >  if  d*  is  changed  after  last  broad¬ 
casting  of  <  degree  >  message. 

5)  u  is  in  state  So  or  S2.  If  u  receives  message  < 
active(v)  >,  then  Wu  =  Wu  U  {v};  if  u  receives 
message  <  degree(v,d*)  >,  u  updates  the  local  record 
of  v's  effective  degree. 

6)  u  is  in  state  S2  and  there  is  no  broadcasting  in 
N\[u]  for  To  time  unit.  If  u  has  biggest  (d*  ,id) 
compared  with  all  nodes  in  Wu,  u  will  broadcast  < 
dominator (u,  null,  ranku)  >  and  go  to  state  S3. 


7)  u  is  in  state  Si .  u  keeps  track  of  the  number  of  higher 
rank  black  neighbors  (number  of  higher  rank  dominators 
in  Ni[u],  denoted  by  db).  When  all  of  its  neighbors 
are  either  in  Si  or  in  S3,  u  broadcasts  the  message 
<  blackdegree(u,d*h)  >. 

8)  u  is  in  state  S3,  u  keeps  track  of  the  gray  neighbor  with  a 
lower  rank  whose  (d£ ,  id)  is  the  biggest,  in  lexicographic 
order.  This  is  u's  candidate  dominator  in  phase  2. 

Remarks:  (i)  Phase  1  generates  many  stars.  Each  star  consists 
of  one  black  node,  which  serves  as  the  center,  and  many  gray 
nodes  dominated  by  the  center.  Each  gray  node  in  a  star  sets 
its  dom  to  the  center.  All  nodes  in  a  star  have  the  same  rank, 
which  is  also  the  rank  of  the  star.  The  dominator  of  a  gray 
node  is  its  first  black  neighbor.  In  other  words,  a  gray  node 
u  resides  in  the  star  centered  at  the  black  neighbor  which 
is  the  first  to  go  to  state  S3.  (ii)  The  star  centered  at  the 
leader  has  rank  0.  The  rank  of  any  other  star  s  centered  at 
u  is  one  plus  the  highest  rank  of  u' s  gray  neighbors  not 
in  s.  (iii)  We  can  also  understand  phase  1  in  the  following 
way:  phase  1  contains  multiple  steps.  Each  step  generates  a 
star.  At  each  step  except  the  first  one,  an  active  node  (in  S2) 
u  with  maximum  effective  degree  compared  with  all  of  its 
active  yellow  neighbors  is  colored  black.  All  of  u' s  1-hop 
white/yellow  neighbors  (in  state  S\  or  S2)  are  colored  gray. 
All  of  u's  2-hop  white  neighbors  (in  state  So)  will  go  to  state 
S2.  (iv)  <  degree  >  message  is  used  to  announce  the  number 
of  white/yellow  neighbors  of  node  u  to  all  of  us  white/yellow 
neighbors.  With  this  information,  a  dominator  will  be  elected 
in  next  step.  <  blackdegree  >  message  is  used  by  gray  node 
u  to  announce  the  number  of  black  neighbors  whose  rank  is 
higher  than  u  to  all  higher  rank  black  neighbors  of  u.  With 
this  information,  a  black  node  v  can  select  a  gray  neighbor 
with  a  lower  rank  than  v  that  connects  to  many  stars. 

After  phase  1,  each  node  is  either  in  state  Si  or  state  S3. 
A  node  in  state  Si  has  a  dominator  in  S3  while  a  node  in 
S3  does  not  have  any  dominator  (except  the  leader).  Phase  2 
will  designate  a  dominator  for  each  black  node  u  (in  S3),  us 
dominator  will  be  the  gray  node  v  in  Si  such  that  rank(v)  < 
rank(u)  and  v  has  the  largest  ( dl,id )  among  all  lower-ranked 
gray  neighbors  of  u.  The  transition  diagram  for  phase  2  is 
shown  in  Figure  7.  We  elaborate  the  details  below. 
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Fig.  7.  The  state  transition  diagram  for  the  second  phase  of  Algorithm  II 
for  each  node. 


1)  After  a  black  node  u  (in  S3)  received  <  blackdegree  > 
from  all  of  its  lower-ranked  gray  neighbors,  it  broadcasts 
message  <  dominator(u,domu,ranku)  >,  where 
domu  is  the  lower-ranked  gray  neighbor  with  highest 
(1 d>l,id ).  Here  ranku  was  set  in  phase  1. 

2)  After  a  gray  node  u  (in  Si)  received 
<  dominator (v,u,rv)  >  from  vy  it  will  broadcast 
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<  dominator(u,domu,ranku)  >  and  go  to  state  S3. 
Here  the  domu  and  ranku  parameters  for  u  were  set 
in  phase  1. 

3)  A  gray  node  u  in  S\  will  keep  track  of  the  black 
neighbor  v  with  the  lowest  rank  by  listening  to  all  < 
dominator  >  broadcastings.  After  all  black  neighbors 
(generated  in  phase  1)  determine  their  dominators,  u  will 
broadcast  message  <  dominatee(u,v,rankv)  >  if  v  is 
different  than  its  original  dominator  domu.  Note  that 
here  u  will  be  a  dominatee  after  phase  2. 

4)  u  is  in  state  S3  but  no  gray  node  selects  u 
as  the  dominator,  u  will  broadcast  message  < 
dominatee(u,domu,ranku)  >  and  go  to  state  S\. 

Remarks:  (i)  Phase  2  assigns  a  dominator  v  to  the  center  u  of 
each  star  generated  in  phase  1  except  the  star  centered  at  the 
leader,  v  is  a  gray  node  satisfying  the  following  conditions:  v 
is  located  in  a  star  with  a  rank  lower  than  u  and  v  is  adjacent  to 
maximum  number  of  stars  with  a  higher  rank  than  v.  (ii)  After 
phase  2,  the  dominator  of  a  gray  node  u  is  always  the  black 
neighbor  with  the  lowest  rank,  (iii)  Phase  2  is  a  local  process 
which  happens  among  adjacent  stars  with  different  ranks.  After 
a  black  node  u  received  <  blackdegree  >  from  all  of  its 
lower-ranked  gray  neighbors,  it  can  select  its  dominator  and 
broadcast  message  <  dominator  (u,domu,ranku)  >.  This 
means  a  black  node  can  initiate  phase  2  immediately  after  all 
necessary  information  is  available.  There  is  no  explicit  start 
time  for  phase  2.  Phase  2  is  used  to  connect  all  stars. 

Remarks:  (i)  There  are  many  similarities  between  Algorithm 
I  and  II.  Both  algorithms  grow  a  maximal  independent  set 
(MIS)  from  the  leader  (see  Theorem  4.1  and  Lemma  5.1).  (ii) 
The  major  differences  are:  (a)  The  criteria  to  decide  whether 
an  active  node  can  be  elected  to  be  a  member  of  the  MIS 
is  different.  In  Algorithm  I,  an  active  node  with  a  smallest  id 
compared  with  all  of  its  active  neighbors  will  be  added  into  the 
MIS.  In  Algorithm  II,  an  active  node  with  maximum  effective 
degree  compared  with  all  of  its  active  neighbors  will  be  added 
into  the  MIS.  (b)  The  time  when  the  dominators  of  the  nodes 
in  the  MIS  are  determined  is  different.  Algorithm  I  assigns  a 
dominator  to  each  node  in  the  MIS  when  the  node  satisfies 
the  condition  in  (a);  Algorithm  II  delays  the  determination  of 
the  dominator  to  phase  2.  Thus  Algorithm  II  can  pick  the  gray 
node  that  connects  to  a  larger  number  of  elements  in  MIS. 

B.  An  example 

Now  we  show  how  to  apply  Algorithm  II  to  the  example 
in  subsection  IV-B.  Initially  all  nodes  are  in  state  So.  Phase  1 
contains  2  steps: 

1)  Host  0  broadcasts  message  <  dominator( 0,0,0)  > 
and  goes  to  state  S3.  After  receiving 

<  dominator  (0,0,0)  >,  nodes  1  and  5 

broadcast  messages  <  dominatee(  1,0,0)  >  and 

<  dominatee( 5,0,0)  >,  respectively,  and  go  to 

state  S\.  After  receiving  <  dominatee(  1,0,0)  > 
and/or  <  dominatee( 5,0,0)  >,  nodes  2,  4,  and  6 
broadcast  messages  <  active( 2)  >,  <  active( 4)  >,  and 

<  active(6)  >,  respectively,  and  go  to  state  S2.  After 


all  these  broadcastings,  nodes  2,  4,  and  6  broadcast 
messages  <  degree( 2,2)  >,  <  degree( 4,1)  >,  and 

<  degree( 6, 3)  >,  respectively.  This  step  generates  the 
star  (0,  {1, 5}). 

2)  Nodes  2,  4,  6  maintain  the  list  of  active  neighbors 
and  the  corresponding  effective  degrees,  which  are 
{(3,6)},  {(2,2),  (1,4)},  and  {(3,6)},  respectively.  The 
winner  of  the  competition  is  node  6  since  it  has  the 
highest  ( d*,id ),  thus  node  6  broadcasts  message  < 
dominator (6,  null,  1)  >  and  go  to  state  S3.  After 
receiving  <  dominator (6,  null,  1)  >,  nodes  2,  3, 
and  4  broadcast  messages  <  dominatee( 2,6,1)  >, 

<  dominatee( 3,6,1)  >,  and  <  dominatee( 4,6,1  >, 
respectively,  and  go  to  state  Si.  Then  all  nodes  1,  2,  3, 

4,  and  5  broadcast  messages  <  blackdegree( 5,1)  >, 

<  blackdegree(\,  1)  >,  <  black degree(2,0)  >,  < 
blackdegree(3,0)  >,  and  <  blackdegr  ee(4,0)  >, 
respectively.  This  step  generates  the  star  (6,  {2,3,4}). 

After  phase  1,  nodes  {0,6}  are  in  state  S3  and  all  other 
nodes  are  in  state  Si.  Phase  2  contains  the  following  step: 

3.  After  receiving  <  blackdegree  >  message  from  node 

5,  the  only  lower- ranked  gray  neighbor,  node  6  broad¬ 
casts  message  <  dominator (6,5, 1)  >.  After  receiving 

<  dominator (6, 5,1)  >,  node  5  broadcasts  message 

<  dominator (5, 0,0)  >  and  goes  to  state  S3.  After 
receiving  <  dominator (5, 0,0)  >,  nodes  2  and  4 
will  selects  node  5  as  its  dominator.  The  algorithm 
terminates.  The  resultant  CDS  contains  nodes  {0,5,6}. 

C.  Performance  analysis 

In  this  subsection,  we  study  the  performance  of  Algorithm 
II. 

Lemma  5.1:  After  phase  1,  all  nodes  in  state  S3  form  a 
MIS.  In  other  words,  the  centers  of  all  stars  form  a  MIS. 

Proof.  A  node  is  colored  black  only  from  white.  No  two 
white  neighboring  nodes  could  be  colored  black  at  the  same 
time  since  they  must  have  different  (d* ,  id).  Whenever  a  node 
is  colored  black,  all  of  its  neighbors  are  colored  gray.  Once  a 
node  is  colored  gray,  it  remains  in  color  gray  during  Phase  1 .  ■ 

Lemma  5.2:  Phase  2  generates  a  connected  dominating  set. 

Proof.  Note  that  phase  1  generates  many  stars  whose 
centers  form  a  MIS.  Two  stars  are  adjacent  if  a  gray  node  in 
one  star  is  a  neighbor  of  the  center  of  the  other  star.  Since 
the  input  graph  is  connected,  a  star  except  the  lowest-ranked 
one  (the  one  with  the  leader)  centered  at  the  leader  has  at 
least  one  adjacent  star.  Each  star  has  a  rank  which  is  higher 
than  the  rank  of  its  adjacent  stars.  Phase  2  connects  a  star  to 
a  lower-ranked  adjacent  star  by  turning  a  gray  vertex  in  the 
lower-ranked  star  to  black.  Thus  after  phase  2,  each  black 
vertex  (the  center  of  a  star)  has  a  path  consisting  of  only 
black  nodes  to  the  leader.  Therefore  all  these  black  vertices 
form  a  connected  dominating  set.  ■ 

Lemma  5.3:  In  phase  2,  the  gray  node  with  maximum 
(d*b,id)  will  connect  to  d*b  number  of  lower-ranked  stars. 


Proof.  Let  u  be  the  gray  node  with  maximum  (d^id). 
Phase  1  ensures  that  u  belongs  to  the  star  whose  center  is  the 
first  black  neighbor  of  u.  For  all  other  black  neighbors  (centers 
of  some  stars)  of  u ,  their  ranks  must  be  greater  than  that  of  u 
since  a  black  node  in  phase  1  always  assigns  its  rank  to  be  one 
plus  the  rank  of  its  highest  rank  dominatee  neighbor.  Thus  all 
these  d*h  nodes  will  select  u  as  their  dominators  in  phase  2.  In 
other  words,  u  connects  to  d*b  number  of  lower-ranked  stars.  ■ 

Theorem  5.1 :  The  connected  dominating  set  generated  in 
Algorithm  II  has  a  size  at  most  (14 rik  +  S)opt  +  4 nk  -  1, 
where  opt  is  the  size  of  any  optimal  MCDS  for  the  given 
instance. 

Proof.  From  Lemma  5.1,  phase  1  computes  a  MIS.  Let  A 
be  this  MIS  with  a  size  \A\.  From  Theorem  3.1,  \A\  <  (7 n^F 
4 )opt  F  2 rik  F  1.  Note  that  in  phase  2,  at  most  \A\  -  1  nodes 
in  state  S\  will  go  to  state  S3.  Now  we  consider  two  cases 
here. 

First,  if  there  exist  a  gray  vertice  with  d*b  >  2  at  the 
beginning  of  phase  2,  from  Lemma  5.3,  the  gray  vertex  u  with 
maximum  ( d*b,id )  will  connect  d*b  stars  to  the  higher  rank  star 
u  resides  in  phase  2.  Therefor  the  number  of  nodes  changing 
state  from  S\  to  S3  in  phase  2  is  at  most  \A\  -  2.  Thus  the 
total  number  of  nodes  in  state  S3  is  at  most  \A\  F  \A\  -  2  < 
(7 rik  +  4 )opt  +  2nk  +  1  +  (7 rik  +  4 )opt  +  2 rik  F  1  —  2  < 
(14 nk  -f  8) opt  -F  Ank  -  1. 

Secondly,  if  all  gray  vertices  have  d*b  <  1  at  the  beginning 
of  phase  2,  then  the  number  of  nodes  changing  state  from 
S\  to  S3  in  phase  2  is  exactly  \A\  -  1.  Since  the  dominator 
of  any  gray  vertex  u  is  its  first  black  neighbor,  and  all  other 
black  neighbors  of  u  have  higher  rank  than  u,  thus  the  total 
number  of  black  neighbors  u  has  is  at  most  2.  In  other 
words,  any  node  in  an  optimal  MCDS  is  either  in  A  or 
adjacent  to  at  most  2  vertices  in  A  and  any  vertex  in  A  is 
dominated  by  a  vertex  in  the  MCDS.  Therefor  in  this  case, 
\A\  <  2  •  opt.  Thus  the  total  number  of  black  nodes  will  be 
2  •  opt  -F  2  •  opt  —  1  <  4  •  opt.  ■ 

Theorem  5.2:  Algorithm  II  has  message  complexity 
O(nA),  where  n  is  the  total  number  of  vertices  and  A  is  the 
maximum  node  degree. 

Proof.  In  Algorithm  II,  the  message  complexity  is 
dominated  by  the  <  degree  >  messages  broadcasted  by 
white  vertices  in  52  in  phase  1  since  each  node  broadcasts 
each  of  the  other  messages  at  most  twice.  Therefore  the 
message  complexity  is  O(nA).  ■ 


VI.  Conclusion 

In  this  paper,  we  studied  the  problem  of  constructing  min¬ 
imum  connected  dominating  set  in  geometric  A;-disk  graphs. 
We  first  derived  the  upper  bound  of  the  size  of  any  maximal 
independent  set  compared  to  that  of  a  MCDS.  Then  we  pro¬ 
posed  two  distributed  approximation  algorithms  and  studied 
their  performance  theoretically. 

As  a  future  research,  we  intend  to  improve  the  upper 
bound  and  design  better  approximation  algorithms  for  MCDS 


in  geometric  /c-disk  graphs.  Note  that  a  polynomial  time 
approximation  scheme  (PTAS)  exists  for  unit-disk  graphs  [15] 
but  whether  a  PTAS  exists  or  not  for  the  general  geometric 
A;-disk  graphs  is  still  open.  This  is  another  problem  we  intend 
to  target  in  the  future. 
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